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ABSTRACT 


This  thesis  is  concerned  with  the  use  of  the  digital 
computer  to  realize  cryptography.  Three  cryptographic 
systems:  simple  substitution,  pseudo-random  cipher 

(polyalphabetic  cipher) , and  data-keyed  cipher,  are 
designed,  implemented  through  computer  programming,  and 
evaluated.  A suitable  cyclic  error  correcting  code  is 
designed  to  encode  these  systems  for  transmission.  The 
code  is  tested  by  simulating  a noisy  channel. 


TABLE  OF  CONTENTS 


I.  DEFINITIONS 10 

II.  INTRODUCTION 12 

III.  HISTORICAL  BACKGROUND 14 

IV.  THEORY  OF  SECRECY  SYSTEMS 20 

A.  INTRODUCTION 20 

B.  EVALUATION  OF  SECRECY  SYSTEMS  20 

C.  PERFECT  SECRECY 21 

D.  EQUIVOCATION 26 

E.  IDEAL  SECRECY  SYSTEMS  27 

V.  DIGITAL  SUBSTITUTION  30 

A.  THE  DECWRITER  SYSTEM 30 

B.  APPLICATION  OF  GROUP  THEORY 

TO  CRYPTOGRAFHY 33 

C.  TRANSFORMATIONS 36 

D.  SIMPLE  SUBSTITUTION  40 

E.  GRAPHICAL  REPRESENTATION  OF  RESULTS  41 

F.  PSEUDORANDOM  SUBSTITUTION  46 

VI.  THE  DATA-KEYED  CIPHER 64 

A.  INTRODUCTION 64 

B.  DESCRIPTION  AND  REALIZATION 64 

C.  TEST  PROCEDURE 67 

D.  RESULTS 71 

E.  COMMUNICATION  SYSTEM  DEGRADATION  73 

VII.  ERROR  CORRECTING  CODE  SCHEME  87 

A.  BEST  CODE  DETERMINATION 8 8 


5 


J.J  *^fZf**W  l' tu  ■! UiV^.^.'.'i  !VM!  IU  ><**  i1  L'V1^ ;*UJ  !J^- 1 Vf-  V W 


B.  THE  (15,4)  CYCLIC  CODE  AND  ITS 

COMPUTER  IMPLEMENTATION  91 

1.  Selection  of  Polynomial 91 

2.  Computer  Realization  of  Encoder  94 

3.  Minimum  Distance  Decoder  95 

C.  NOISY  CHANNEL  SIMULATION  95 

VIII.  SUMMARY  AND  CONCLUSIONS  101 

APPENDIX  A 103 

APPENDIX  B 108 

APPENDIX  C 109 

APPENDIX  D 110 

APPENDIX  E 117 

APPENDIX  F 118 

APPENDIX  G 120 

LIST  OF  REFERENCES 122 

INITIAL  DISTRIBUTION  LIST  124 


6 


LIST  OF  FIGURES 


1.  A SECRECY  SYSTEM 21 

2.  BLOCK  DIAGRAM  OF  THE  SIMPLE  SUBSTITUTION  CIPHER 42 

3.  SIMPLE  SUBSTITUTION  CIPHER-ENCRYPTING  EXAMPLE 43 

4.  PLAINTEXT  ENGLISH  LANGUAGE -DISTRIBUTION  PLOT 47 

5.  PLAINTEXT  ITALIAN  LANGUAGE -DISTRIBUTION  PLOT 48 

6.  PLAINTEXT  SPANISH  LANGUAGE -DISTRIBUTION  PLOT 49 

7.  PLAINTEXT  FRENCH  LANGUAGE -DISTRIBUTION  PLOT 50 

8.  SIMPLE  SUBSTITUTION -DISTRIBUTION  PLOT-KEY=A  51 

9.  SIMPLE  SUBSTITUTION  DISTRIBUTION  P LOT-KEY =C 52 

10.  SIMPLE  SUBSTITUTION  DISTRIBUTION  PLOT-KEY=G  53 

11.  PSEUDORANDOM  CIPHER-BLOCK  DIAGRAM  56 

12.  PSEUDORANDOM  CIPHER-DISTRIBUTION  PLOT-KEY=C 60 

13.  PSEUDORANDOM  CIPHER-DISTRIBUTION  PLOT-KEY=K  61 


14.  PSEUDORANDOM  CIPHER-DISTRIBUTION  PLOT-  7 ALPHABETS  “62 

15.  PSEUDORANDOM  CIPHER-DISTRIBUTION  PLOT-23  ALPHABETS  -63 


16.  THE  DATA- KEYED  CIPHER-CONCEPT * 66 

17.  THE  DATA-KEYED  CIPHER- REALIZATION  68 

18.  THE  DATA-KEYED  CIPHER-BLOCK  DIAGRAM  69 

19.  THE  DATA-KEYED  CIPHER-ENCRYPTING  PROCESS  EXAMPLE  74 

20.  THE  DATA-KEYED  CIPHER-ENCRYPTING  PROCESS  EXAMPLE  75 

21.  THE  DATA-KEYED  CIPHER-EXAMPLE  OF  TRANSIENT 

SUBSTITUTION 82 

22.  THE  DATA-KEYED  CIPHER- DISTRIBUTION  PLOT-KEY=A,  i=7  —83 

23.  THE  DATA-KEYED  CIPHER-DISTRIBUTION  PLOT-KEY-C, i=7  —84 


7 


24.  THE  DATA-KEYED  CIPHER-DISTRIBUTION  PLOT-KEY=J , i=2  85 

25.  THE  DATA-KEYED  CIPHER-DISTRIBUTION  PLOT-KEY^J, i=17  — 86 

26.  THE  4-STAGE  ENCODER  OF  THE  CHARACTERISTIC 

POLYNOMIAL  G(X)  = x4  + x + 1 93 

27.  SECURE  DIGITAL  COMMUNICATION  SYSTEM  BLOCK  DIAGRAM  98 


8 


LIST  OP  TABLES 


I.  USASCII-68  CHARACTER  CODE 31 

II.  DECWRITER  PRINTING  CHARACTERS  AND  THEIR 

BINARY  REPRESENTATION  32 

III.  INTERMEDIATE  KEY  VALUES 39 

IV.  FREQUENCY  OF  THE  LETTERS  OF  THE  ENGLISH 
ALPHABET/  ARRANGED  ALPHABETICALLY  7*ND 

BY  FREQUENCY 44 

V.  SIMPLE  SUBSTITUTION  CIPHER-TABLE  OF 

OCCURRENCES 54 

VI.  DATA-KEYED  CIPHER-TABLE  OF  OCCURRENCES  76 

VII.  DATA-KEYED  CIPHER-TABLE  OF  OCCURRENCES  77 

VIII.  TABLE  OF  MESSAGE  WORDS  AND  THEIR  CORRES- 
PONDENT CODE  WORD  FOR  THE  (15/4)  CYCLIC 

CODE 96 

IX.  P (e)  VS.  CHANNEL  8 FOR  THE  (15,4)  CODE 97 


I . DEFINITIONS 


The  following  definitions  are  given  to  acquaint  the 
reader  with  some  of  the  terms  commonly  encountered  in  the 
field  of  cryptography. 

Cryptology  is  the  branch  of  knowledge  that  deals  with 
the  development  and  use  of  all  forms  of  secret  communication. 

Cryptography  is  the  branch  of  cryptology  that  deals 
with  secret  writing. 

Cryptanlaysis  is  uhe  branch  of  cryptology  that  deals 
with  the  analysis  and  solution  of  cryptographic  systems. 

A Cipher  is  a cryptographic  system  which  conceals, 
in  a cryptographic  sense,  the  letters  or  groups  of  letters 
in  the  message  or  plaintext. 

Enciphering  is  the  operation  of  concealing  a plaintext, 
and  th^  result  is  a cipher  text,  or  in  general  a cryptogram. 

Deciphering  is  the  process  of  discovering  the  secret 
meaning  of  a cipher  text. 

A key  is  the  variable  parameter  of  a cipher  system, 
prearranged  between  correspondents,  which  determines  the 
specific  application  of  a general  cipher  system  being 
used.  The  use  of  keys  permits  almost  endless  variations 
within  a given  cipher  system.  In  fact,  the  value  of  a 
specific  cipher  system  is  based  on  how  hard  it  is  for  an 
"enemy"  to  break  a cryptogram  or  series  of  cryptograms, 
assuming  he  knows  the  complete  details  of  the  system  but 


10 


lacks  the  keys  which  were  used  to  encipher  the  cryptograms 
originally . 

A code  is  a cryptographic  system  which  substitutes 
symbol  groups  for  words,  phrases,  or  sentences  found  in 
the  plaintext.  It  involves  the  use  of  a codebook,  copies 
of  which  are  kept  by  each  correspondent. 

Encoding  is  the  operation  of  concealing  a message 
using  a code. 

Decoding  is  the  process  of  recovering  an  encoded 
message . 

A code  differs  from  a cipher  because  a code  deals  with 
plaintext  in  variable  size  units,  such  as  words  or  phrases, 
while  a cipher  deals  with  plaintext  in  fixed  size  units, 
usually  a letter  at  a time. 


U ..'.■.y|.»l..".«(1il.ll!»J^jS_M|..JPJJIlH.,W^  ilJW 


II.  INTRODUCTION 

Since  there  is  no  way  of  making  data  communication 
links  physically  secure,  particularly  if  some  form  of  radio 
transmission  is  involved,  encryption  is  the  only  practical 
method  of  protecting  the  transmitted  data.  In  the  commer- 
cial world  and  nonmilitary  parts  of  government,  there  is  a 
growing  need  for  encryption.  This  need  for  encryption  is 
not  just  to  satisfy  the  legal  requirements  for  privacy, 
but  also  to  protect  systems  from  criminal  activities. 

At  the  present  time,  communication  systems  seem  to  be 
going  towards  digital  means.  There  are  already  in  use 
digital  systems  for  data  communications  as  well  as  for 
public  services  such  as  the  telephone  system. 

The  present  work  was  intended  to  study  the  possibility 
of  using  a digital  computer  to  realize  cryptographic  systems. 
Further,  this  computer  can  be  envisioned  as  part  of  a digital 
communication  system,  mainly  to  do  cryptography  and  to 
implement  suitable  error  correcting  codes.  The  DEC 
PDP-11/40  minicomputer  was  used  to  do  this  study. 

Through  this  work,  three  cryptographic  systems  were 
designed,  ranging  from  a simple  substitution  cipher  to  a 
data-keyed  cipher.  On  the  latter  the  message  itself  con- 
stituted the  key  to  modify  other  characters.  Very  signi- 
ficant results  were  obtained  from  it  in  the  sense  that  it 
gives  rise  to  a text  where  its  characters  were  nearly 
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equiprobable.  Further,  a cyclic  error  correcting  code 
was  designed  and  implemented  to  work  with  these 
cryptographic  systems. 
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III.  HISTORICAL  BACKGROUND 


Some  of  the  earliest  practical  crytographic  systems 
were  the  monoalphabetic  substitution  systems  used  by  the 
Romans  [Ref.  1].  In  these,  one  letter  is  substituted  for 
another.  For  example,  an  A might  be  replaced  by  a C. 

By  the  fifteenth  century,  an  Italian  by  the  name  of  Alberti 
came  up  with  a technique  of  cryptoanalyzing  letters  by 
frequency  analyses.  As  a result,  he  invented  probably  the 
first  polyalphabetic  substitution  system  using  a cipher 
disk.  Thus,  he  would  rotate  the  disk  and  encode  several 
more  words  with  the  next  substitution  alphabet. 

Early  in  the  sixteenth  century  Trithemius,  a Benedic- 
tine Monk,  had  the  first  printed  book  published  on  cryp- 
tology. Trithemius  described  the  square  table  or  tableau 
which  was  the  first  known  instance  of  a progressive  key 
applied  to  polyalphabetic  substitution.  It  provided  a 
means  of  changing  alphabets  with  each  character.  Later  in 
the  sixteenth  century,  Vigenere  perfected  the  autokey;  a 
progressive  key  in  which  the  last  decoded  character  led  to 
the  next  substitution  alphabet  in  a polyalphabetic  key. 

These  were  basically  the  techniques  that  were  widely  applied 
in  the  cryptomachines  in  the  first  half  of  the  twentieth 
century.  Various  transposition  techniques  have  been  em- 
ployed including  the  wide  use  of  changing  word  order  and 
techniques  such  as  rail  transpositions  (used  in  the  Civil 
War)  . 
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| In  1883,  Auguste  Kerckhoffs,  a man  born  in  Holland  but 

f 

a naturalized  Frenchman,  published  a book  entitled  La 

\ 

Cryptographic  Militaire.  In  it,  he  established  two  general 
principles  for  cryptographic  systems.  They  were: 

1.  A key  must  withstand  the  operational  strains  of 
heavy  traffic.  It  must  be  assumed  that  the  enemy 
has  the  general  system.  Therefore,  the  security 
of  the  system  must  rest  with  the  key. 

2.  Only  cryptoanalysts  can  know  the  security  of  the 
key.  In  this,  he  infers  that  anyone  who  proposes  a 
cryptographic  technique  should  be  familiar  with 
the  techniques  that  could  be  used  to  break  it. 

From  these  two  general  principles,  six  specific  require- 
ments emerged  in  his  book: 

1.  The  Key  should  be,  if  not  theoretically  unbreakable, 

i 

at  least  unbreakable  in  practice. 

2.  Compromise  of  the  nardware  system  or  coding  tech- 
nique should  not  result  in  compromising  the  security 

> of  communications  that  the  system  carries. 

* 

3.  The  key  should  be  remembered  without  notes  and 

> should  be  easily  changeable. 

4.  The  cryptograms  must  be  transmittable  by  telegraph. 
Today  this  would  be  expanded  to  include  both  digital 
intelligence  and  voice  (if  voice  scramblers  are 
employed)  utilizing  either  wire  or  radio  as  the 
medium. 

'■  A 

r 4 
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5.  The  apparatus  or  documents  should  be  portable  and 
operable  by  a single  person. 

6.  The  system  should  be  easy,  neither  requiring 
knowledge  of  a long  list  of  rules  nor  involving 
mental  strain. 

In  1917  Gilbert  S.  Vernam,  a young  engineer  at  American 
Telephone  and  Telegraph  Company,  using  the  Baudot  code 
(teletype)  invented  a means  of  adding  two  characters 
(exclusive  or) . Vernam' s machine  mixed  a key  with  text 
as  illustrated  by  the  following: 


I 


4 

,1 

ri 

% 


Clear  Text  10  111  \ 

1 

Key  0 10  10  l 

j 

Coded  Character  11101 


To  derive  the  text  from  the  coded  character,  all  that  was 
required  was  the  addition  of  the  key  again  to  the  coded 
character. 


Coded  Character  11101 
Key  0 10  10 

Clear  Text  10111 


His  machines  used  a key  tape  loop  about  eight  feet  long 
which  caused  the  key  to  repeat  itself  over  a high  volume 
of  traffic.  This  allowed  cryptoanalysts  to  derive  the 
key.  William  F.  Friedman,  in  fact,  solved  cryptograms 
using  single-loop  code  tapes  but  appears  to  have  been 
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unsuccessful  when  two  code  tapes  were  used.  Major  Joseph 
Om  Mauborgne  {U.S.  Army)  then  introduced  the  one-time 
code  tape  derived  from  a random  noise  source.  This  was 
one  of  the  first  theoretically  (and  in  practice)  unbreaka- 
ble code  systems.  The  major  disadvantage  of  the  system 
was  the  enormous  amounts  of  key  required  for  high-volume 
traffic. 

During  the  1920's  and  1930!s,  the  rotor-code  machines 
having  five  and  more  rotors,  each  rotor  representing  a 
scrambling  step,  were  developed.  They  proved  relatively 
insecure,  requiring  only  high-traffic  volume  for  the 
cryptoanalyst  to  break  them.  In  fact,  the  Japanese  used 
a code-wheel-type  machine  for  their  diplomatic  communica- 
tions well  into  World  War  II.  It  was  vulnerable  to  crypto- 
analysis, and  William  F.  Friedman  and  his  group  not  only 
solved  the  code  but  reconstructed  a model  of  the  machine 
to  break  Japanese  diplomatic  correspondence.  Thus,  Presi- 
dent Roosevelt  and  others  were  aware  of  the  impending  break 
in  diplomatic  relations  with  Japan  just  prior  to  World  l 

War  II.  I 

5 

a 

The  code  wheels  (or  rotors)  were  nothing  more  than  | 

•j 

key  memories  storing  quantities  of  key  which  could  easily  j 

be  changed  by  interchanging  rotor  positions,  specifying  j 

various  start  points  for  each  rotor,  and  periodicaly  re- 
placing a set  of  rotors.  This  provided  a means  of  producing 
what  is  called  key  leverage. 
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Journal  provide  interesting  literature  on  Digital  Data 
Scramblers , 

Today,  the  most  commonly  encountered  commercial  crypto- 
system is  based  on  the  "shift  register,"  [Ref.  5].  Despite 
design  variations,  shift  registers  are  used  as  pseudorandom 
key  generators.  The  implementation  of  data  scramblers  with 
pseudorandom  sequences  using  logic  circuits  is  suggested 
by  Twigg  [Ref.  6],  and  Henrickson  [Ref.  7].  The  idea  of 
shift  register  sequences  is  well  treated  by  Golomb  [Ref.  8]. 
The  relative  weakness  of  pseudorandom  -codes  is  pointed  by 
Meyer  and  Tuchman  [Ref.  9],  from  I.B.M.  For  high  security, 
Torrieri  [Ref.  10],  and  Geffe  [Ref.  11],  introduce  the  idea 
of  using  nonlinear  as  well  as  linear  operations.  The  theory 
of  nonlinear  operations  is  also  contained  in  Ref.  8. 

Finally,  the  appearance  of  modern  high  speed  digital 
computers  has  risen  speculation  as  how  best  to  apply  its 
capabilities  since  it  is  available  for  both  cryptography 
and  cryptanalysis.  Even  the  newest  microprocessors  are 
reported  [Ref.  12],  as  being  designed  for  encription 
devices. 

A very  comprehensive  historical  exposition  with  some 
descriptive  technical  content  is  the  book  by  Kahn,  The 
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Codebreakers  [Ref.  13],  which  appeared  in  1967.  Of  special 
interests  are  the  sections  devoted  to  the  cryptographic 
agencies  of  the  major  powers,  including  the  United  States. 

For  the  interested  reader  in  the  field  of  cryptography, 
the  American  Cryptogram  Association  publishes  "The  Crypto- 
gram," a bimonthly  magazine  of  articles  and  cryptograms. 

The  hobby  of  solving  cryptograms  provides  a fascinating 
intellectual  challenge.,  Patient  analysis  and  flashes  of 
insight,  combined  with  the  enthusiasm  of  uncovering 
something  hidden,  give  cryptanalysts  an  enjoyment  which  is 
almost  unique. 
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IV.  THEORY  OF  SECRECY  SYSTEMS 


A.  INTRODUCTION 

A secrecy  system  is  defined  as  a set  of  transformations 
of  one  space  (the  set  of  possible  messages)  into  a second 
space  (the  set  of  possible  cryptograms) . Each  particular 
transformation  of  the  set  corresponds  to  enciphering  with 
a particular  key.  The  transformations  are  supposed  rever- 
sible (non-singular)  in  order  to  obtain  unique  deciphering 
when  the  key  is  known  together  with  the  specific  system 
used. 

Each  key  and  therefore  each  transformation  is  assumed 
to  have  an  a priori  probability  associated  with  it.  Simi- 
larly each  possible  message  is  assumed  to  have  an  associated 
a priori  probability  of  being  selected  for  encryption. 

These  two  represent  the  a priori  knowledge  of  the  situation 
for  a cryptoanalyst  trying  to  break  the  cipher. 

To  use  the  system  a key  is  first  selected  and  sent 
to  the  receiving  point.  The  choice  of  a key  determines  a 
particular  transformation  in  the  set  forming  the  system. 

Then  a message  is  selected  and  the  particular  transformation 
corresponding  to  the  selected  key  is  applied  to  the  message 
to  produce  a cryptogram.  This  cryptogram  is  transmitted  to 
the  receiving  point  by  a channel  where  it  can  be  intercepted 
by  an  undesired  agent.  At  ..ne  receiving  end,  the  inverse 
of  the  particular  transformation  is  applied  to  the  cryptogram 
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to  recover  the  original  message.  Figure  1 provides  the 
conceptual  idea  of  a secrecy  system. 


MESSAGE  I inessa 
SOURCE  I 


undesired 

agent 


ENCIPHERER  V EECIPHEPER 

cryptogram 


SOURCE 


Figure  1.  A Secrecy  System. 


If  the  referred  undesired  agent  intercepts  the  trans- 
mitted cryptogram  through  a channel,  he  can  calculate  from 
it  and  from  his  possibel  knowledge  of  the  system  being  used, 
the  a posteriori  probabilities  of  the  various  possible 
messages  and  keys  which  might  have  produced  this  cryptogram. 
This  set  of  a posteriori  probabilities  constitutes  his 
knowledge  of  the  key  and  message  after  the  interception. 
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The  calculation  of  the  a posteriori  probabilities  is  the 
generalized  problem  in  cryptanalysis. 


C.  PERFECT  SECRECY 

Shannon  [Ref.  14] , provides  for  concepts  such  as 
entropy,  redundancy,  equivocation  and  many  others  that  are 
helpful  for  evaluating  secrecy  systems. 

Let  us  assume  that  the  message  space  is  constituted 
by  a finite  number  of  messages  P^,  P2,  . ..,  Pn  with  an 
associated  a priori  probabilities  p(P1) , p(P2) , ...,  p(PR) 
and  that  these  messages  are  mapped  into  the  cryptogram 
space  by  the  transformation 


C-  = T.  P . 
3 i J 


The  cryptanalyst  intercepts  a particular  C^  and  can 
then  calculate  the  a posteriori  conditional  probability 
for  the  various  messages,  p(Pj./Cj),  It  seems  natural  now 
to  define  that  one  condition  for  perfect  secrecy  is  that  for 
all  C j , the  a posteriori  probabilities  of  the  messages  P 
given  that  Cj  has  been  received,  are  equal  to  their  a 
priori  probabilities,  independent  of  these  values.  Or, 
from  an  information  theory  viewpoint,  intercepting  the 
cryptogram  has  given  the  cryptanalyst  no  information  about 
the  message;  he  just  knows  that  a message  was  sent.  On 
the  other  hand,  if  this  condition  is  not  satisfied  there 
will  exist  situations  in  which  the  cryptanalyst  has  certain 
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a priori  probabilities  and  certain  choices  of  key  and 
message  thus  preventing  perfect  secrecy  to  be  achieved. 

Shannon  [Ref.  15] , gives  a theorem  stating  the  necessary 
and  sufficient  conditions  for  perfect  secrecy,  namely 

P(C/P)  = p(C) 

for  all  the  messages  (P)  and  all  the  cryptograms  (C) . 

Where 


p (C/P)  = Conditional  probability  of  crypto- 
gram C to  occur  if  message  P is 
chosen. 

p{C)  = Probability  of  obtaining  cryptogram 
C for  any  cause. 

Stated  in  other  terms,  the  total  probability  of  all 
keys  that  transform  P^  into  a given  cryptogram  C is  equal 
to  that  of  all  keys  transforming  Pj  into  the  same  C,  for 
all  P^,  Pj  and  C. 

In  the  Mathematical  Theory  of  Communications  given  by 
Reference  14,  it  was  shown  that  a convenient  measure  of 
information  was  the  entropy.  For  a set  of  events  with 
probabilities  p^,  p,,  . ..,  p , the  entropy  H is  given  by: 

H « - l p.  log  pi 

n 
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In  a secrecy  system  there  are  two  choices  involved,  that 
of  the  message  and  that  of  the  key.  We  may  measure  the 
amount  of  information  produced  when  a message  is  chosen 
by 

H(P)  = - Lp(P)  log  p(P) 

the  summation  being  over  all  possible  messages.  Similarly, 
there  is  an  uncertainty  associated  with  the  choice  of  key 
given  by 

H (K)  « - I p (K)  log  p (K) 

For  perfect  secrecy  systems  the  amount  of  information 
in  the  message  is  at  most  log  n (occurring  when  all  messages 
are  equiprobable) . This  information  can  be  concealed 
completely  only  if  the  key  uncertainty  is  at  least  log  n. 

In  a more  general  way  of  expressing  this:  There  is  a 

limit  to  what  we  can  achieve  with  a given  uncertainty  in 
key,  the  amount  of  uncertainty  we  can  introduce  into  the 
solution  cannot  be  greater  than  the  key  uncertainty. 

The  situation  gets  more  complicated  if  the  number  of 
messages  is  infinite.  For  example,  assume  that  messages 
are  generated  as  infinite  sequences  of  letters  by  a suitable 
Markoff  process.  From  the  definition,  no  finite  key  will 
give  perfect  secrecy.  We  can  suppose  then,  that  the  key 
source  generates  keys  in  the  same  manner,  that  is  as  an 


infinite  sequence  of  symbols.  Suppose  further  that  only  a 
certain  length  is  needed  to  encipher  and  decipher  a 
length  L of  message.  Let  the  logarithm  of  the  number  of 
letters  in  the  message  alphabet  be  Rp  and  that  for  the  key 
alphabet  be  R^.  Then  from  the  finite  case,  it  is  evident 
that  perfect  secrecy  requires 


Rp  Lp  i Rk  Lk 


This  type  of  perfect  secrecy  is  obtained  by  the  Vernam 
system  [Ref.  16]. 

Thus,  it  can  be  concluded  that  the  key  required  for 
perfect  secrecy  depends  on  the  total  number  of  possible 
messages.  The  disadvantage  of  perfect  systems  for  large 
correspondence  systems  such  as  for  data  communications  and 
data  retrieval  services,  is  the  equivalent  amount  of  key 
that  mu::t  be  sent. 

In  this  paper  the  requirement  for  a large  key  for  large 
messages  is  eliminated  by  designing  a self  keyed  system 
that  will  continually  originate  key  letters  based  on  several 
past  letters  that  were  already  ciphered.  Provided  enough 
distance  is  chosen  in  between  selected  letters  the  system 
will  avoid  the  statistical  dependency  of  consecutive  letters 
in  a natural  language,  thus  generating  a sequence  of  key 
letters  suitable  for  any  message  length. 
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D.  EQUIVOCATION 

A cryptographic  system  can  be  compared  with  a communi- 
cation system  in  the  sense  that  whereas  in  one  the  signal 
is  unintentionally  perturbed  by  noise,  and  in  the  other, 
namely  the  cryptographic  system,  the  message  is  inten- 
tionally perturbed  by  the  ciphering  process  to  hide  the 
information.  Thus,  there  is  an  uncertainty  of  what  was 
actually  transmitted.  From  information  theory  a natural 
mathematical  measure  of  uncertainty  is  the  conditional 
entropy  of  the  transmitted  signal  when  the  received  signal 
is  known.  This  conditional  entropy  is  known  as  equivocation. 

HCX/Y)  = - Z p(x,y)  log  p(x/y) 

From  the  point  of  view  of  tl  > cryptanalyst,  a secrecy 
system  is  almost  identical  wi  .h  a noisy  communication 
system.  The  message  is  operated  by  a statistical  element, 
the  enciphering  system,  with  its  statistically  chosen  key. 

The  result  of  this  operation  is  the  cryptogram,  which  when 
transmitted  is  vulnerable  to  interception  and  available  for 
analysis.  The  main  differences  in  the  two  cases  are: 

1.  The  operation  of  the  enciphering  transformation 
is  generally  of  a more  complex  nature  than  the  perturbing 
noise  in  a channel. 

2.  The  key  for  a secrecy  system  is  usually  chosen 
from  a finite  set  of  possibilities  while  the  noise  in  the 
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channel  is  more  often  continually  introduced,  in  effect 
chosen  from  an  infinite  set. 

With  these  considerations  in  mind  it  is  natural  to  use 
the  equivocation  as  a -theoretical  secrecy  index.  It  may 
be  noted  that  there  are  two  significant  equivocations, 
that  of  the  key  and  that  of  the  message  which  are  denoted 
as  HfK/C)  and  H(P/C) : 

H (K/C)  = -£p(C,K)  log  p(K/C) 

H (P/C)  = - £ p(C,P)  log  p(K/P) 

The  same  general  arguments  used  to  justify  the  equivo- 
cation as  a measure  of  uncertainty  in  communication  theory 
apply  here  as  well.  Zero  equivocation  requires  that  one 
message  (or  key)  have  unit  probability  and  all  others  zero, 
corresponding  to  complete  knowledge. 

E.  IDEAL  SECRECY  SYSTEMS 

In  Reference  15,  the  concept  of  equivocation  leads  to 
means  of  evaluating  secrecy  systems  as  a function  of  the 
amount  of  N,  the  number  of  letters  received.  It  is  shown 
that  for  most  systems  as  N increases  the  referred  equivo- 
cations tend  to  decrease  to  zero,  consequently  the  solution 
of  the  cryptogram  becomes  unique  at  a point  called  unicitv 
point. 

In  the  section  on  Perfect  Secrecy  it  was  stated  that 
perfect  secrecy  requires  an  infinite  amount  of  key  if 
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messages  of  unlimited  length  are  allowed.  With  a finite 
key  size,  the  equivocation  of  key  and  message  generally 
approaches  zero.  The  other  extreme  is  for  H(K/C)  to  be 
equal  to  H(K) . Then,  no  matter  how  much  material  is 
intercepted,  there  is  not  a unique  solution  but  many  of 
comparable  probability.  An  ideal  system  can  be  defined  as 
one  in  which  H(K/C)  and  H(P/C)  do  not  approach  zero  as 
N increases.  A strongly  ideal  system  would  be  one  in  which 
H(K/C)  remains  constant  at  H(K) , that  is,  knowing  the  crypto 
gram  has  not  aided  in  solving  the  key  uncertainty. 

An  example  of  an  ideal  cipher  is  a simple  substitution 
in  an  artificial  language  in  which  all  letters  are  equi- 
probable  and  successive  letters  independently  chosen. 

With  natural  languages  it  is  in  general  possible  to 
approximate  the  ideal  characteristic.  The  complexity  of 
the  system  needed  usually  goes  up  rapidly  when  an  attempt 
is  made  to  realize  this.  To  approximate  the  ideal  equivo- 
cation, one  may  first  operate  on  che  message  with  a trans- 
ducer which  removes  all  redundancies.  After  this  almost 
any  simple  ciphering  system  - substitution,  transposition, 
etc.,  is  satisfactory.  The  more  elaborate  the  transducer 
and  the  nearer  the  output  is  to  the  desired  form,  the 
more  closely  will  the  secrecy  system  approximate  the  ideal 
characteristic . 

The  work  to  be  presented  in  following  sections,  will 
describe  a scheme  to  approximate  the  ideal  secrecy  system 
by  using  a digital  computer  to  mainly  accomplish  two  things: 
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1.  Change  the  probability  structure  of  natural  languages 
to  obtain  an  almost  equiprobable  occurrence  of  letters. 

2.  Eliminate  the  statistical  dependence  of  successive 
letters  in  natural  languages. 

Further/  a message  transformed  tc  reflect  these 
properties,  will  be  either  transmitted  as  such  or  an  addi- 
tional conventional  ciphering  can  be  made. 
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V.  DIGITAL  SUBSTITUTION 


The  development  of  a digital  substitution  cipher  was 
the  first  step  taken  to  accomplish  the  present  work. 

After  it#  more  complex  variations  were  experimented  to 
obtain  a reasonable  secure  system  taking  advantage  of  the 
use  of  the  computer.  Thus#  it  can  be  said  that  most  of  the 
subsequent  work  rests  on  these  first  results.  A brief 
explanation  follows  of  the  Decwriter  system  and  its  character 
codes  used  to  interface  with  the  PDP-11/40  computer. 

A.  THE  DECWRITER  SYSTEM 

The  LC11  Decwriter  system  is  a high-speed  teletype- 
writer designed  to  interface  with  the  PDP-11  family  of 
processors  to  provide  both:  Input  (keyboard)  and  output 

(printer)  functions  for  the  system.  It  can  be  used  as  the 
console  input/output  device.  The  system  can  receive 
characters  from  the  keyboard  or  can  print  at  speeds  up  to 
30  characters  per  second  in  standard  ASCII  formats.  The 
character  code  used  is  USASCII-68  which  is  listed  in  Table 
No.  I.  From  these  128  characters,  only  64  are  printing 
characters#  those  of  columns  2,  3,  4 and  5.  Table  No.  II 
presents  these  64  characters  and  their  correspondent 
binary  representation. 


30 


SP 

10100000 

0 

10110000 

@ 

11000000 

p 

f 

10100001 

1 

10110001 

A 

11000001 

Q 

If 

10100010 

2 

10110010 

B 

11000010 

R 

# 

10100011 

3 

10110011 

c 

llOOQGil 

s 

$ 

10100100 

4 

10110100 

D 

11000100 

T 

% 

10100101 

5 

10110101 

E 

11000101 

u 

Si 

10100110 

6 

10110110 

F 

11QQ011Q 

V 

f 

10100111 

7 

10110111 

G 

11000111 

w 

( 

10101000 

8 

10111000 

H 

11001000 

X 

\ 

10101001 

9 

10111001 

I 

11001001 

Y 

* 

10101010 

: 

10111010 

J 

11001010 

z 

+ 

10101011 

? 

10111011 

K 

11001011 

[ 

t 

10101100 

< 

10111100 

1 

11001100 

1 

- 

10101101 

= 

10111101 

M 

11001101 

] 

• 

10101110 

> 

10111110 

N 

11001110 

A 

/ 

10101111 

7 

10111111 

0 

11001111 

TABLE  II  - DECWRITER  PRINTING  CHARACTERS 
THEIR  BINARY  REPRESENTATION 
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11010000 

11010001 

11010010 

11010011 

11010100 

11010101 

11Q1Q110 

11010111 

11010000 

11011001 

11011Q1C 

11011011 

11011100 

110U101 

11011110 

11011111 

AND 


B.  APPLICATION  OF  GROUP  THEORY  TO  CRYPTOGRAPHY 

A group  is  defined  as  a set  of  elements  a,  b,  c,  ... 
and  an  operation,  denoted  by  + for  which  the  following 
properties  are  satisfied: 

a)  For  any  elements  a,b,  in  the  set,  a + b is  in  the 

set. 

b)  The  associative  law  is  satisfied;  that  is,  for 
any  a,b,c  in  the  set 

a + (b  + c)  = (a  + b)  + c 

c)  There  is  an  identity  element,  I,  in  the  set  such 

that 

a+I=I+a=a;  ail  a in  the  set. 

d)  For  each  element  a,  there  is  an  inverse  a * in 
the  set  satisfying 

a + a^  = a^  + a = I 

A group  is  abelian  or  commutative  if 

a + b = b + a for  all  a and  b in  the  set. 

The  integers  under  ordinary  addition  and  the  set  of 
binary  sequences  of  a fixed  length  n under  exclusive-or 
operation  are  examples  of  abelian  groups. 


i 

j 
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From  boolean  algebra,  an  additional  property  of  an 
abelian  group  of  binary  sequences  of  a fixed  length  n 


under  the  exclusive-or 

operation  is 

that. 

given 

a + b = c 

then 

a + c = b 

and 

b + c * a; 

for  all  a,b  and  c in 

the  group. 


The  8-bit  binary  sequences  with  which  the  computer 
handles  the  ASCII  code  characters  is  in  this  sense  an 
abelian  group.  This  last  property  suggested  the  idea  of 
encrypting  simply  by  exclusive-oring  the  desired  set  of 
sequences  by  a key  (another  sequence  or  a set  of  sequences) . 
Decrypting  or  recovery  of  the  original  sequences  can  be 
done  simply  by  exclusive-oring  the  obtained  set  of  sequences 
with  the  key. 

Basically  the  transformation  can  be  expressed  as 

C = K + P , for  encryption,  and 

P * K + C , for  decryption, 

where  C,  K and  P represent  an  8-bit  sequence  stored  in  a 
register  and  the  symbol  + stands  for  the  logical  exclusive- 
or  operation. 

g 

While  it  is  clear  that  the  whole  2 8-bit  sequences 
can  be  used  to  represent  crypto  sequences,  since  this  set 
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of  sequences  constitute  an  abelian  group;  a limitation  was 
imposed  through  this  work  to  allow  trans formations  to  be 
done  between  printing  characters  (those  of  Table  II) . 

That  is,  restrict  the  domain  and  range  of  the  transforma- 
tions to  the  binary  sequences  of  Table  II. 

We  can  further  realize  the  12  possible  combinations 
of  two  sequences  of  same  or  different  sets  by  exclusive- 
oring  them  and  observe  that  the  range  of  the  transformations 
is  given  by  the  sets  of  sequences  whose  4-left  most  are: 

0000  for  A + A 

B + B 
C + C 
D + D 

0 0 0 1 for  A + 3 

B + i 
C + D 
D + C 

0110  for  A + C 

C + A 
B + D 
D + B 

0111  for  A + D 

D + A 
B + C 
C + B 
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C.  TRANSFORMATIONS 

From  Table  II  it  can  be  observed  that  these  sequences 
no  longer  form  a group  under  the  exclusive-or  operation# 
since  choosing  any  two  sequences  will  originate  a new 
sequence  not  in  the  referred  table.  For  example: 

Plaintext  character  = A=  11000001  + 

Key  character  = L = 11001100 

Ciphered  character  = 00001101 

And  we  obtained  a sequence  00001101  not  in  the  table. 

If  we  observe  sets  A,  B,  C and  D of  Table  II,  we  will 
observe  that  each  set  has  its  4-left  most  bitrs  equal.  Or 
that  the  dom  ain  of  the  transformation  is  given  by  the 
sequences  who^e  4-left  most  bits  are: 


Set 

A 

10  10 

Set 

B 

1 Q 1 1 

Set 

C 

110  0 

Set 

D 

110  1 

In  order  to  make  the  range  of  the  transformations  equal 
to  its  domain  in  accordance  with  the  restriction  imposed, 
an  additional  binary  multiplier:  The  intermediate  key  (IK) 

was  devised.  It  allowed  for  mapping  into  the  64  printing 
characters . 
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Tho  value  of  IK  is  dependent  on  the  particular  transfor- 
mation desired  and  the  key  to  be  used.  For  example: 

A system  is  designed  to  transform  characters  from  set  B 
into  characters  of  set  C for  encryption.  The  decryption 
is  done  by  doing  the  inverse.  Now  assume  that  the  key  to 
be  used  for  a particular  transformation  belongs  to  set  D. 

Plaintext  character  = 8 = 10111000  (Set  B) 

Key  character  = Z = 11011010  (Set  D) 

01100010 
IK  = 10100000 

Crypto  character  = B = 11000010  (Set  C) 

The  intermediate  key  value  was  obtained  by  exclusive- 
oring  the  4-left  most  bits  of  the  plaintext,  the  key  and 
the  crypto  characters.,  as  shown  below. 


Plaintext 

character 

1011  + 

Key 

character 

1101  + 

Crypt< 

character 

1100 

IK 

10100000 

For  decrypting  the  inverse  is  done,  that  is: 
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'W  j*  iinwi'  7J.^11,*';  «j 


Crypto  character  = B = 11000010  (Set  C) 
Key  character  = Z = 11011010  (Set  D) 

00011000 

IK  = 1010Q000 

Plaintext  character  = 8 = 10111000 


Based  on  the  concepts  so  far  presented  and  the  idea 
of  the  intermediate  key  multiplier,  that  allows  for  sequences 
of  Table  II  to  behave  like  a group,  Table  III  was  con- 
structed. It  gives  the  necessary  values  of  IK  for  all 
possible  transformations  in  between  sets.  From  this  general 
table,  it  can  be  obtained  typical  tables  of  required  values 
of  IK  for  each  specific  transformation.  For  example,  if 
we  assume  that  the  desired  transformation  between  the  four 
sets  were 


encryption 

/decryption^ 

A-  C 

^decryption  / 
encryption 


/ 


encryption 

decryption 


B 


D 


decryption  / 
encryption 


Then  the  required  table  of  IK  values  will  be: 
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m Cfcfctoa 


KEY  SET 

A 

B 

C 

D 

PLAINTEXT 

SET 

A 

C 

D 

A 

B 

B 

C 

D 

A 

B 

C 

C 

D 

A 

B 

D 

C 

D 

A 

B 

D.  SIMPLE  SUBSTITUTION 

Although  the  scheme  develop ia  and  presented  until  now 
provides  for  transformations  using  the  64  printing  charac- 
ters, a restriction  was  placed  to  be  able  to  handle  only 
the  26  letters  of  the  English  alphabet  plus  the  additional 
6 characters  that  appear  in  Table  No.  II,  sets  C and  D. 
Thus,  for  the  simple  substitution  ciphers  transformations 
were  designed  between  these  two  sets , that  is , 


Encryption 
! Decryption^ 


Decryption  / 
Encryption 


And  the  corresponding  table  of  values  of  intermediate  keys 
will  be: 
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Figure  2 shows  in  block  diagram  the  computer  realization 
of  this  simple  substitution  cipher.  Appendix  A gives  the 
complete  program  to  accomplish  this.  Figure  3 is  an 
example  of  this  cipher. 


E.  GRAPHICAL  REPRESENTATION  OF  RESULTS 

Natural  languages,  such  as  English,  Spanish,  German, 
French,  etc.,  have  a characteristic  letter  frequency.  For 
example,  the  normal  frequency  for  English  is  as  shown  in 
Table  IV. 

For  the  purpose  of  observing  the  statistical  nature 
of  plaintexts  as  well  as  of  cryptograms  obtained,  a computer 
program  (shown  in  Appendix  B and  C)  was  made  to  realize 
the  following  computations: 

- Count  the  . lumber  of  occurrences  of  each  latter  in 
a text. 

- Calculate  and  plot  the  percentage  of  occurrence  of 
each  character  in  the  text. 


- Calculate  the  mean  value  of  percentage  of  occurrences. 

- Calculate  the  standard  deviation  of  the  percentage  of 
occurrences . 
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Input  key 
character 
to  RO 


Identify  key 
set  and  assign 
identifying 
nvirfcer  to  it 
(R3)=l  SET  A 
(R3)  =3  SET  B 
(R3)=5  SET  C 
I fB31=2  SEC  D 


Figure  2.  Block  diagram  of  the  program  for  the 
simple  substitution  cipher 
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Alphabetically  By  frequency 


A 

- 

7.3% 

E 

- 

13.0% 

B 

- 

0.9 

T 

- 

9.3 

C 

- 

3.0 

N 

- 

7.8 

D 

- 

4.4 

R 

- 

7.7 

E 

- 

13.0 

I 

- 

7.4 

F 

- 

2.8 

0 

- 

7.4 

G 

* 

1.6 

A 

- 

7.3 

H 

- 

3.5 

S 

- 

6.3 

I 

- 

7.4 

D 

- 

4.4 

J 

- 

7.2 

H 

- 

3.5 

*•* 

*% 

o.3 

L 

- 

3.5 

L 

- 

3.5 

C 

- 

3.0 

M 

- 

2.5 

F 

- 

2.8 

N 

- 

7.8 

P 

- 

2.7 

0 

- 

7.4 

U 

- 

2.7 

? 

- 

2.7 

M 

- 

2.5 

Q 

- 

0.3 

Y 

- 

1.9 

R 

- 

7.7 

G 

- 

1.6 

S 

- 

6.3 

W 

- 

1.6 

T 

- 

9.3 

V 

- 

1.3 

U 

mm 

2.7 

B 

- 

0.9 

V 

- 

1.3 

X 

- 

0.5 

W 

- 

1.6 

K 

- 

0.3 

X 

- 

0.5 

Q 

- 

0.3 

Y 

- 

1.9 

J 

- 

0.2 

Z 

- 

0.1 

Z 

- 

0.1 

TABLE  IV  - FREQUENCY  OF  THE  LETTERS  OF  THE  ENGLISH 
ALPHABET , ARRANGED  ALPHABETICALLY  AND  BY 
FREQUENCY 
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For  each  transformation  done,  the  text  was  analyzed 
by  this  program  and  the  results  were  plotted.  In  the 
horizontal  axis  are  the  32  chosen  characters  in  the 
following  order  from  zero  to  31: 

0ABCDEFGHIJKLMNOPQRSTUVWXY  Z [/]  A_ 

In  the  vertical  axis  the  percentage  of  occurrence  scale 
or  frequency  distribution  is  plotted. 

Examples  of  these  plots  are  given  by  Figures  5 to  8. 
There  the  frequency  distribution  of  letters  for  the 
following  languages  is  plotted: 

Figure  4:  ENGLISH 

Figure  5:  SPANISH 

Figure  6:  FRENCH 

Figure  7:  ITALIAN 

The  author  has  preferred  to  give  the  results  achieved 

through  this  work  by  presenting  these  plots  rather  than 
giving  messages  and  their  cryptograms  as  examples  of  what 
was  obtained.  Inherent  with  these  plots  is  an  evaluation 
of  the  system  used  in  each  case.  Additional  information 
that  will  be  found  in  these  plots  is  the  standard  deviation 
of  percentage  of  occurrence  of  the  character  in  each 
cryptogram. 
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For  the  simple  substitution  cipher,  it  was  expected 
to  obtain  similar  results  as  for  the  plaintext  of  Figure  5. 
Figures  8 to  10  show  the  frequency  distribution  of  characters 
when  this  system  was  used  with  different  keys.  As  expected, 
similar  results  were  obtained  but  with  the  values  changed 
from  one  character  to  another.  This  occurred  since  one 
character  or  letter  has  just  been  replaced  by  another 
through  these  transformations.  Table  V presenting  in 
tabular  form  the  number  of  occurrences  for  these  substitu- 
tions gives  a figure  of  what  has  occurred  with  the  messages 
in  each  case. 

In  Section  IV,  Theory  of  Secrecy  Systems,  it  was  stated 
that  one  goal  to  achieve  ideal  secrecy  was  to  change  the 
probability  structure  of  natural  languages  to  obtain  an 
equiprobable  occurrence  of  letters.  This  is  the  reason  why 
the  calculation  of  standard  deviation  was  considered  to 
evaluate  secrecy  obtained.  Since  the  language  to  be  used 
in  this  present  work  will  be  English  it  may  be  useful  to 
keep  in  mind  that  the  standard  deviation  for  an  English 
text  is  3.81  as  stated  in  Figure  4. 

F.  PSEUDORANDOM  SUBSTITUTION 

The  simple  substitution  cipher  can  also  be  called 
monoalphabetic  cipher  since  there  is  only  one  alphabet 
to  encipher  the  message.  The  cryptanalytic  weakness  of 
this  cipher  is  the  fact  that  a given  plain  language  letter 
is  always  represented  by  the  same  crypto  letter. 
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NUMBER  OP  OCCURRENCES 


KEY 


Character* 

§ 

A 

C 

s 

K 

N 

@ 

24 

3 

77 

7 

0 

0 

A 

3 

24 

94 

12 

0 

248 

B 

94 

77 

3 

37 

24 

0 

C 

77 

94 

24 

128 

4 

0 

D 

128 

37 

7 

77 

248 

0 

E 

37 

128 

12 

94 

0 

0 

F 

12 

7 

37 

3 

0 

4 

G 

7 

12 

128 

24 

0 

24 

H 

4 

24 

0 

248 

77 

12 

I 

24 

4 

0 

0 

94 

7 

J 

0 

0 

24 

0 

3 

128 

K 

0 

0 

4 

0 

24 

37 

L 

0 

0 

248 

0 

7 

94 

M 

0 

0 

0 

0 

12 

77 

M 

0 

248 

0 

24 

37 

24 

0 

248 

0 

0 

4 

128 

3 

P 

11 

105 

27 

12 

3 

68 

Q 

105 

11 

27 

32 

3 

93 

R 

27 

27 

10.5 

160 

76 

33 

S 

37 

27 

11 

33 

63 

48 

T 

33 

160 

12 

27 

93 

3 

U 

160 

33 

32 

27 

68 

3 

V 

32 

12 

160 

105 

48 

63 

W 

12 

32 

33 

11 

33 

76 

X 

63 

76 

3 

93 

27 

32 

Y 

76 

63 

3 

68 

27 

12 

Z 

3 

3 

76 

48 

105 

33 

C 

3 

3 

63 

33 

11 

160 

/ 

33 

48 

93 

3 

12 

27 

] 

48 

33 

68 

3 

32 

27 

Ss 

68 

93 

48 

76 

160 

11 

93 

68 

33 

63 

33 

105 

Table  No.  V . - Simple  substitution  cipher 
Table  of  number  of  occurren 
ces . 
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In  this  section,  a digital  polyalphabetic  substitution 
very  much  alike  to  the  Vigenere  square,  cited  by  Sinkov 
[Ref.  17],  is  designed.  The  originality  of  the  scheme 
presented  here  is  the  fact  that  the  different  alphabets 
are  used  in  a pseudorandom  way  and  that  this  is  generated 
through  a simple  algorithm  in  the  computer. 

The  basis  for  the  program  to  realize  this  cipher  is 
provided  by  the  same  algorithm  as  for  the  simple  substitution 
case,  the  only  variation  being  that  the  key  will  change  for 
each  character  to  be  ciphered.  These  changes  of  key  are 
controlled  by  a program  and  thus  the  inverse  transformation 
can  be  made  to  decipher  by  using  the  same  program.  This 
fact  that  we  are  using  a different  key  each  time  is  the 
same  as  using  a new  substitution  alphabet  for  each  character. 

It  must  be  set  clear  here  that  the  key  used  was  a single 
letter  and  not  a number  of  letters  equal  to  the  message 
length.  This  single  letter  was  used  to  initialize  a register 
used  as  a counter.  For  each  new  letter  of  the  message 
the  register  contents  were  increased  by  one  each  time  until 
a specific  number  was  reached,  in  which  case  the  register 
was  reset  to  zero.  This  specific  number  is  the  desired 
number  of  alphabets  to  be  used.  Figure  11  gives  a graphical 
idea  of  how  this  was  accomplished.  In  the  figure,  N 
represents  the  total  number  of  alphabets  to  be  used;  it 
ranges  from  one,  for  a simple  substitution,  to  32  when  using 
all  the  possible  alphabets. 


Figure  11.  Psuedorandom  cipher  block  diagram 
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The  result  expected  for  this  cipher  was  the  origination 
of  an  artificial  language  with  32  possible  characters  and 
with  a letter  frequency  different  than  that  of  the  plaintext 
message  in  natural  English  language. 

To  observe  the  results  of  this  cipher  two  sets  of 
transformations  were  made: 

1.  Using  15  alphabets  and  six  different  keys. 

The  keys  used  were: 

a)  e 

b)  A 

c)  C 

d)  G 

e)  K 

f)  N 

2.  Using  a single  key  and  different  number  of 
alphabets,  in  the  following  order: 

a)  7 alphabets;  key  R 

b)  15  alphabets;  key  R 

c)  23  alphabets;  key  R 

d)  31  alphabets;  key  R 

Figures  12  and  13  show  some  results  obtained  for  the 
first  set  of  transformations  as  a plot  of  percentage  of 
occurrence  of  the  32  different  characters.  As  can  be 
observed,  for  the  six  cases,  all  the  characters  have  a 
certain  number  of  occurrences  in  the  cryptogram  obtained, 
thus  giving  rise  to  an  artificial  language  of  32  characters 
witha  quite  different  letter  frequency  than  the  plaintext 
of  Figure  4. 
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In  the  same  way.  Figures  14  and  15  show  some  results 
obtained  for  the  second  set  of  transformations,  which  are 
essentially  the  same  as  the  first  set. 

A measure  of  how  different  these  results  are  from  the 
plaintext  is  provided  by  the  standard  deviations  in  each 
case  and  sure  here  listed  to  provide  a means  of  evaluating 
the  results  achieved: 

Number  of  alphabets  Key  Std.  Deviation 


15 

@ 

1.528 

15 

A 

1.528 

15 

C 

1.528 

15 

G 

1.528 

15 

K 

1.528 

15 

N 

1.528 

7 

R 

1.467 

15 

R 

1.545 

23 

R 

1.407 

31 

R 

1.329 

These  standard  deviation  values  compared  with  the  3.81  for 
the  plaintext,  represent  a significant  flattening  of  the 
percentage  of  occurrence  plots,  or  in  other  words,  the 
cryptogram  has  a more  equiprobable  letter  frequency. 

A significant  property  of  this  scheme  if  we  envision  it 
as  part  of  a digital  communication  system,  is  the  fact  that 
it  offers  no  error  propagation  during  the  message  processing. 
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The  reason  for  this  is  the  fact  that  each  character  is 
operated  upon  independently  from  all  others.  Thus,  if 
there  is  an  error  in  the  bit  representation  of  a letter, 
there  will  be  an  error  in  its  transformation  to  crypto 
character  or  in  the  decryption  of  it  and  no  error  will 
occur  in  other  characters  due  to  it. 

In  the  next  section,  a cryptographic  scheme  will  be 
presented  that  although  contributing  to  the  communication 
system  degradation,  gives  better  results  in  the  sense  that 
a nearly  equiprobable  artificial  language  is  achieved 
which  represents  a significant  achievement  for  security  of 
data  transmission  and/or  data  storage. 
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VI.  THE  DATA-KEYED  CIPHER 

A.  INTRODUCTION 

In  this  section  the  data-keyed  cipher  is  presented. 

First/  a very  general  description  of  the  system  is  given. 

Then  the  transfer  function  concept  of  the  cipher  and  the 
reversibility  and  consistency  of  its  is  explained/  together 
with  the  equated  logical  form  of  the  transformation  which 
the  author  appreciates  as  being  a very  meaningful  representa- 
tion of  the  cipher  in  logical  form.  After  that  the  computer 
realization  is  presented  in  block  diagram  form.  The  test 
procedure  for  valuating  secrecy  accomplished  and  significant 
results  are  then  given.  Finally,  the  communication  system 
degradation  due  to  it  is  analyzed. 

B.  DESCRIPTION  AND  REALIZATION 

Section  IV  explains  how  the  PDP-11/40  computer  is 
handled  to  realize  the  simple  substitution  cipher,  con- 
sistency was  shown  with  some  examples  and  further,  the 
known  cryptoanalytic  weakness  of  it  was  explained  and 
graphically  represented  by  Fig.  4 where  it  can  be  observed 
the  frequency  distribution  of  the  plaintext  and  of  some 
cryptograms  and  their  similarity  can  be  established. 

The  data-keyed  cipher  can  be  explained  in  a general 
form  as  the  scrambling  of  the  bits  of  a character  by 
operating  on  them  by  past  characters,  either  of  the  plain- 
text, when  ciphering,  or  of  the  cryptogram,  when  deciphering. 


Provided  these  past  characters  are  far  enough  apart  in 
the  sequence  their  operation  on  the  character  to  be  trans- 
formed will  result  in  a nearly  random  trams formation. 

This  idea  was  supported  by  the  fact  that  for  far  enough 
distance  between  two  letters  in  a written  language  there 
is  nearly  no  statistical  dependence  between  them. 

Figure  16  provides  the  conceptual  idea  of  this  cipher. 
At  this  point,  two  significant  characteristics  that 
distinguish  this  cipher  are  to  be  emphasized: 

1.  From  Figure  16(a)  and  (b)  it  can  be  seen  that  both 
diagrams  can  be  conceived  as  c.  transfer  function  that 
essentially  perform  similar  transformations  on  their  inputs. 
An  advantage  is  that  when  this  is  realized  in  the  computer 
by  a program,  the  same  program  will  execute  both  trans- 
formations; that  of  ciphering  and  deciphering. 

2.  From  Figure  16(b)  it  can  be  observed  that  there  is 
no  feedback  present,  that  is,  the  outputs  are  not  dependent 
on  past  outputs.  The  significance  of  this  fact  will  be 
considered  at  the  end  of  this  section  when  system  degrada- 
tion for  this  cipher  is  treated. 

The  realization  of  this  ciphering  scheme  again  uses  the 
basic  transformations  presonted  in  Section  IV,  plus  addi- 
tional steps  are  included  to  accomplish  the  data-keyed 
function.  The  conceptual  idea  given  in  Figure  16  can  now 
be  expressed  in  logical  equated  form  as: 
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b)  Deciphering  j 

Figure  16.  Data-Keyed  Cipher-Concept  | 

1 


CIPHERING:  Cj=(K  + Cj^)  + 

DECIPHERING:  Pj  = (K  + Cj^)  + C.. 

where 

Pj  = present  plaintext  character 

Cj  = present  crypto  character 

C s "iM  times  prc  jeding  crypto  character 
K = key  character 

Again  the  operator  used  is  the  Exclusive-Or . These 

logical  equations  show  the  reversibility  of  the  trans- 
formation and  thus  its  consistency. 

Figure  17  is  now  presented  to  give  a more  significant 
representation  of  the  transformation  to  be  realized.  The 
index  "i"  is  selective  and  it  represents  the  distance 
between  characters  already  explained. 

Figure  18  shows  the  block  diagram  of  the  realization 
of  this  cipher  in  the  Pr?-ll/40. 

Appendix  D gives  the  complete  listing  of  the  program 
used. 

C.  TEST  PROCEDURE 

The  plaintext  message  used  to  test  the  results  of  this 
cipher  scheme  was  the  one  presented  in  Section  IV  with  its 
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statistics  representative  of  the  English  language  as  shown 
in  Figure  4. 


This  cipher,  as  depicted  by  Figure  17,  has  two  possible 
choices  of  variables,  namely: 

- The  key,  with  a total  of  32. 

- The  delay  factor  "i"  which  could  be  varied  from 
zero,  for  a simple  substitution;  up  to  any  number  n. 
However,  for  any  choice  of  n there  will  be  the  same  amount 
of  simple  substitution  characters  at  the  beginning  of  the 
cryptogram.  This  disadvantage  can  be  avoided  by  using  for 
the  first  letters  of  the  plaintext,  meaningless  text. 

As  for  the  simple  substitution  case,  the  intermediate 
keys  were  selected  to  reflect  the  transformations  between 
sets  C and  D of  Table  II. 

To  observe  the  results  obtained  with  this  cipher  two 
sets  of  transformations  were  made: 

1.  Using  a fixed  value  of  "i"  and  six  different  keys. 
For  i = 7 and  the  keys: 

a) 

b) 

c) 

d) 

e) 

f) 


2.  For  a fixed  key  and  the  following  values  of  "i" 


J) : 

a) 

i = 

2 

b) 

i = 

3 

c) 

i = 

10 

d) 

• 

1 a 

13 

e) 

• 

1 = 

17 

f) 

i = 

20 

D.  RESULTS 

The  results  obtained  for  this  cipher  were,  in  all 
cases,  significantly  better  than  the  Pseudorandom  cipher 
of  the  previous  section  in  the  sense  that  the  standard 
deviations  were  much  lower,  thus  obtaining  a nearly 
equiprobable  text  of  cryptograms. 

For  the  test  procedure  established,  the  following  were 
the  specific  results  obtained: 

1.  For  a fixed  value  of  "ir  and  using  6 out  of  32 
possible  keys  the  following  were  the  values  of  standard 
deviation  obtained: 


Key 


A 

C 

G 

K 

N 


"i"  Standard  deviation 


7 

7 

7 

7 

7 

7 


0.5783 

0.6301 

0.5395 

0.5651 

0.5608 

0.6015 
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Figures  22  and  23  are  some  example  plots  for 
these  cases.  These  figures  are  shown  at  the  end  of  this 
section . 

2.  For  a fixed  key,  different  values  of  "i"  were 
tried.  The  values  of  standard  deviation  obtained  in  each 
case  were: 


Key 

•t 

Standard  deviation 

J 

2 

0.5761 

J 

3 

0.5344 

J 

10 

0.528 

J 

13 

0.5317 

J 

17 

0.4609 

J 

20 

0.501 

Figures  24  and  25  are  some  example  plots  for 
these  cases. and  are  presented  at  the  end  of  this  section. 

We  can  now  compare  these  results  with  the  statistics  of 
a plaintext  English  message  with  a standard  deviation  of 
3.81  (see  Figure  4).  A significant  flattening  of  the 
percentage  of  occurrence  plots  has  occurred.  In  addition 
the  statistical  dependence  of  occurrence  of  the  letter  in 
the  message  has  been  hidden.  The  reason  for  this  will  be 
explained  in  the  last  part  of  this  section  where  the  nature 
of  the  ciphering  scheme  is  explained  in  detail,  together 
with  the  inherent  degradation  to  a communication  system 
due  to  it. 
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In  Section  IV  it  was  stated,  from  Shannon  [Ref.  15], 
that  an  ideal  cipher  may  be  an  artificial  language  in  which 
all  letters  are  equiprobable  and  successive  letters 
occurring  independently.  This  is  nearly  the  case  for  this 
cipher.  Now  a simple  stabs titution,  such  as  the  one 
presented  in  Section  V,  can  be  performed  on  the  message 
without  making  it  easier  to  decipher. 

3.  A very  meaningful  characteristic  of  this  scheme 
was  the  fact  that  the  same  program  recovers  or  deciphers 
the  message.  Figures  19  and  20  present  two  examples  of  the 
encrypting  results  after  being  processed  by  the  program 
corresponding  to  this  cipher. 

To  give  an  idea  of  the  number  of  occurrences  of 
each  character  in  the  cryptograms  for  each  of  the  12  cases 
of  (1)  and  (2) , Tables  VI  and  VII  are  next  presented. 

4.  The  implementation  of  this  cipher  in  a digital 
computer  can  also  be  seen  as  the  implementation  of  a code 
where  the  transformations  are  dependent  on  a key  (a  letter 
or  character) , the  present  letter  to  be  encoded  and  some 
past  crypto  character. 

E.  COMMUNICATION  SYSTEM  DEGRADATION 

Due  to  the  nature  of  the  process  of  ciphering  and 
deciphering  of  this  system,  it  can  be  said  that  when  it 
comes  to  play  an  integral  part  of  a communication  system, 
it,  at  the  most,  will  double  the  probability  of  block  error. 
Here  the  block  length  has  been  8 bits  corresponding  to  a 


73 


CTHIS_B00K_IS_DES16MEB_  = RIHRRILV_F0R_USE_fiS_fl_FIR5T..VEfiR 
. GRRDUfiTE_TEKT_IN_I?JF0Rf1ftTI  0M_TH£0RY'_5UI  TRBLE_FOR_BOTH_E 
NGIMEERS_flND_HRTHEHflTICIflMS_<*'_I  T_IS_R5SUMED_THRT_7HE_PER 
PER.HRS-SQME_UMDERSTfiNDIMG_OF_FRESHKfcN_CfiLCUtUS-fiND.ELEri 
ENTRRY.PROBftBILITV_flMD_!M_THE_LfiTER_CWfiPTERS.SOKE_INT*:£> 
UCT0RY»RRND0M_PRGCE5S_THEGRV,_&_UNF0RTUNftTELV.THERE_I  S_OM 
E_MGRE>REQUIREMENT_THRT_I  S_HRPGER_TO_MEET.(?_f  HE-PERDER^M 
UST_HRVE_R_RERSOMfiBLE_LEVEL_OF_MflTHEMflT!CflL-MflTUF:I  TV 

a)  Plaintext  message  (input) 


CGC  2@LQ\\_GP@(s,V2LUS  3VW  J I M_  3_G  3UC  RZGNKK EUMXRBVGP-LHFNCG'RN 
8\G\TGPI V89ZQ8LSC L2W2M1 3YKPC M8JV_HftflK8l\HUP" 3LVPI IDPJWEV 
7 3C  _ S C! D F H _ V X 8 _ V R 5 V X 2 G' U V C GTHFTI M.M'  HJV I LJSX_FRR'‘MOOTPMCX- 
XYELVQHCHUC  ZJGUT‘'PKHXZR~SPGG  3FMOU8\  — C X-W-MPMKCUPC  EYR  3 3 
S-LMGHftNBS  3UVR~TEIRMC  UB"XEF\  31  3\OSG8  3P-OHUF8FIBSQ  * I \ 3FC.V 
HYKQGICGZT\ 3YE8FYTZMI I G 3PXKGEZK K 3 3HFBLVVJC XLMK 3_K 3DYNGPP 
3RZZ0QMCC  CLPNMSUVD8GUUDI\DLSMDS-HHKPKZUYJDXLLPSEKR2t  Z80C 
MBMGQRJ 3RPRFWP  JQSRPYGHJMI T_E 3 V I V C CTSSt NSC  PC 0 3VIEJC LE 

b)  Cryptogram  message  (output) 

Figure  19.  Data-Keyed  cipher 
Encrypting  process 


74 


| D7H I S-BC'C'K-I  S_D£SI  GNEI*_PR  I MRP I LV_FOR_USE_ AS_fi_F  I PST_VERR 

I 

1 

! . GRftDUftTE-TEKT.IN-lKFORMAT I 9M„THE9RV_SU3  TRRLE_FGR_BC'7H_  E 

NGINEEPS_ftND_MftTHEMftT I 01^*4 £._§•_  ! T_I5_ftSSU!1EC'„THfiT„THE_F:Fft 
DER_HflS-SGHE_lW0EPSTRMDING_0F_FRESHf1fif4.CRLCUl.US.ftND.ELEP 
fc  NTRRV_PRGBRBILITV_RMD_IM_THE_LRTER_CHfiPTERS_SGil1E_IMTR0D 
| UCTGRV-RftNDGM-PR0CESS_THE0RV_g»_UNF0RTUf4ATElV,_THEP£_IS..GN 

I E _NORE_REQUIREMEMT_THRT_I  5_HftPDER_T0_l1EET_Ct,..THE_PEADER_H 

i 

UST_HftVE_ft_REftS0E4flBLE_L£VEL-GF_HftTHEMRTICftL_MRTUE:n  V 

a)  Plaintext  message  (input) 


L'C-S  3GKVC  E _ G P @ 8 V Z L R T 2 Q P M U J _ 3_6  3 U C R 3VILLBRJXflBVGP-L0ftIDVUI 
GSGSTGPI V GG  3HGKTSLZW2  \ M 3 V L H \ P G M G X M ft  ft  K C"  L \ H R U VZK~  U N I Q R J W E V 
T2SXC  V C R 0 _ V X 8 _ V R 5 ~ _ 3 8 R C! \ VTHFTIM M V 0 M Q N K M T X _ F fl R ~ M OHS W J D _ X 
. VELVQHCHRS  3M8F:SV0KMXZR'SW8VZRPHR9\_-[  K_HXC  NWJLDRPC  EVR  3 3 
SXKP80FIES]UVR~TENFP\REY-EF\3I  3\OTeGZUXHOUF9ftBSG'NC  ZAD'XC! 
GVKQGI C6ZSE Z'BGRGTZHI I G 3P_L8B 3LLZ 3WFBLVYJ\_K ILZXL 3GVNGGP 
3F  3 3HV JD\CLPNWSUVCG8RRCNC  DL\HDS_MOLWL  3P' MDXLLG'SEK  U 3\ 3GH\ 
JBMGQR J 3RWFRPWHVE R P V 0 L U W I S X B Z ' N ~ \ C T 5 S E MSC VXHZ'NBME LE 

b)  Cryptogram  message  (output) 

Figure  20.  Data-Keyed  cipher 
Encrypting  process 
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>Lj\n  NKjX5|<|GH[n»OD023:t,<«c(H|3iO'qMOnw>® 


NUMBER  OF 

OCCURRENCES 

KEY 

( i = 7 ) 

Character 

e 

A C 

G K 

N 

36 

33 

40 

46 

45 

32 

35 

38 

37 

40 

40 

34 

35 

40 

33 

32 

32 

42 

55 

50 

51 

52 

53 

50 

47 

42 

56 

50 

42 

48 

46 

51 

52 

49 

46 

59 

47 

55 

41 

42 

44 

47 

50 

42 

41 

40 

46 

36 

41 

35 

48 

35 

43 

41 

38 

44 

44 

38 

41 

52 

34 

41 

28 

31 

29 

33 

47 

40 

40 

44 

38 

34 

44 

37 

34 

47 

48 

37 

42 

49 

39 

45 

45 

47 

29 

29 

32 

29 

29 

33 

32 

32 

42 

38 

37 

33 

51 

37 

47 

38 

36 

45 

43 

57 

48 

44 

48 

51 

50 

55 

45 

43 

61 

58 

58 

53 

62 

60 

61 

50 

53 

39- 

42 

51 

49 

46 

40 

54 

43 

47 

50 

41 

51 

51- 

48 

50 

52 

63 

38 

38 

49 

51 

5Q 

53 

59 

62 

45 

54 

56 

47 

64 

61 

53 

56 

53 

49. 

43 

37 

54 

40 

38 

50 

37 

43 

51 

51 

52 

36 

52 

40 

46 

37 

39 

43 

51 

63 

58 

55 

51 

60 

52 

52 

46 

60 

42 

58 

52 

52 

57 

57 

56 

44 

Table 

No.  VI 

. — Data- 

•keyed  < 

cipher 

Table  of  number  of 


occurrences . 
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>Lj\r-iNhO<s:<c:^cn(dO,tiOZI5t<«^HBO,flWOOW>® 


NUMBER  OF  OCCURRENCES 


« 

" i " VALUES 

( KEY  = J ) 

Character  2 

3 10 

13  17 

20 

37 

42 

40 

32 

42 

46 

41 

40 

36 

35 

41 

48 

48 

39 

49 

34 

36 

40 

44 

37 

38 

40 

29 

39 

34 

43 

47 

41 

41 

50 

43 

41 

46 

49 

50 

47 

47 

43 

35 

47 

42 

48 

45 

46 

48 

39 

40 

33 

48 

39 

44 

33 

48 

38 

38 

36 

34 

53 

45 

35 

32 

54 

36 

46 

42 

38 

52 

42 

40 

38 

41 

31 

41 

42 

37 

38 

40 

38 

37 

41 

34 

44 

36 

36 

45 

28 

52 

48 

35 

42 

26 

45 

42 

41 

50 

49 

44 

46 

49 

59 

51 

50 

36 

52 

58 

50 

48 

45 

61 

36 

46 

53 

47 

45 

46 

65 

37 

56 

48 

62 

60 

62 

43 

43 

52 

48 

49 

50 

47 

54 

56 

50 

54 

44 

45 

55 

40 

55 

46 

50 

62 

38 

50 

49 

43 

58 

53 

36 

46 

44 

49 

42 

51 

49 

49 

52 

44 

45 

41 

49 

57 

54 

44 

57 

53 

55 

49 

36 

60 

50 

48 

40 

39 

45 

54 

42 

62 

46 

55 

55 

52 

50 

55 

55 

53 

56 

52 

45 

44 

56 

54 

48 

Table 

No.  VII.- 

Data- 

■keyed  cipher 

Table  of  number  of 


occurrences . 
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byte.  Zt  must  be  emphasized  that,  although  for  ease  of 
computer  realization  the  8-bit  byte  was  used  to  represent 
a letter;  only  5 bits  could  have  been  enough  since  we  are 
using  only  32  letters  or  characters. 

This  increase  in  probability  of  error  can  be  said  to 
be  significant  but  with  the  availability  of  error  correcting 
codes  the  initial  probability  of  error  can  be  reduced  as 
desired  and  appropriately  so  that  doubling  it  when  using 
the  cryptosystem  will  not  be  that  significant.  Further, 
since  a computer  is  being  used  to  implement  it,  it  also 
can  be  used  to  realize  a suitable  error  correcting  scheme. 

In  the  next  section,  a suitable  error  correcting  scheme  is 
presented,  that  will  essentially  overcome  this  degradation. 

The  examples  that  follow  are  intended  to  explain  how 
the  probability  of  block  error  is  doubled  and  also  the 
existence  of  a transient  simple  substitution  for  the  first 
"i"  characters. 

Based  on  these  two  examples  the  following  observations 
can  be  made: 

1.  There  is  a transient  simple  substitution  for  the 
first  "i"  characters  when  enciphering.  This  is  the  case 
of  C^,  C2  and  C 3 from  Example  1. 

2.  After  the  transient  simple  substitution,  the  crypto 
characters  are  a result  of  a number  of  plaintext  characters. 
And,  the  higher  the  index  of  the  crypto  to  be  obtained,  the 
more  the  number  of  plaintext  characters  on  which  it  depends. 


78 


Ev»mple  No.  1 Enciphering  process 

Transformation:  Cj  *=  (K  + + Pj 

Plaintext  sequence:  pi'P2'P3'P4/P5'P6,P7/P8'P9 

Let  i = 3 
C,  * K + P, 


C2  = 


K + P, 


C3  = 


K + P. 


K + Cx  + P4  = 


K + (K  + P,)  + P, 


P1  + P4 


K + C2  + P5 


K + (K  + P2)  + P5 


P2  + P5 


C6  = 


K + C3  + P6 


K + C4  + P? 


K + (K  + P3)  + Pg 
K + (Px  + P4)  + P? 


P3  + P6 


C8  ' 


K + C5  + P8  “ 


K + (Pj  + Pj)  + Pg 


K + C6  + Pg  - 


K + (Pg  + Pg)  + Pg 


'10 


K + C,  + P1Q 


P1  + P4  + P7  + P10 


•u 


c,„  «■ 


•12 


K + c8  + P11 


K + Cg  + P12 


P2  + P5  + P8  + P11 

p3  + P6  + Pg  + P12 


’13 


K + ci0  + P13 


- K + Px  + P4  + P7  + P10  + P13 
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3.  The  order  of  dependency  observed  in  Example  1 is 
different  for  the  deciphering  case,  where  the  recovering 
of  the  text  is  just  dependent  on  two  crypto  characters. 
Thus,  one  error  in  the  crypto  sequence  will  just  give  rise 
to  two  errors  in  the  plaintext. 

Figure  21  gives  an  exeu  pie  of  the  transient  simple 
substitution  explained.  The  value  of  "i**  chosen  there 
is  50.  As  an  example  it  can  be  observed  here  that  fcr 
the  first  50  characters  of  the  plaintext  the  letter  R is 
always  substituted  by  the  letter  C. 


: \ 

RTHI5-IS-RN_EXRMPLE-0F_fl-CVCLIC-ERR0R_C0RRECTING-C0DE-ftP 
PLI  ED-TO-R.C  I PHERED. MESSAGE  — NOISE-GENERATED-!  N-fl-PROGR 
RW-I5-MODULO_TWO_ADDED-TO-THE-MESSftGE-TO_TEST-THE-EFFECT 
IVENESS-OF-THE-CODEB 

a)  Plaintext 


r 


Transient  substitution 




ft  E u X B N Yi BNP-NTIP'.A  3TN'  WNPNRHR  3XPNTC  C ' C NR^CCTPEX-VNP' 'JOGHC 
0 U Z Z U ft  K ZGPBSUEWZDZ  U 9 '■  \ 9 0 X T Z J H H 0 1 V 9 Z II V P 3 9 H - C Z fc  Sri  .v  M N F T \ F 
C'MD-BL-C  PCWZDBFPBPZ  3P.DHS0H-TDQ0B0Q  3C  MO VMHZK'MCf  .PBPVPP  2G 
T H W E H J " G*  N G V " C F:  ft  0 \ R 

b)  Cryptogram 


Figure  21.  Data- keyed  Cipher  - Example  of 
transient  substitution,  i = 50. 
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VII.  ERROR  CORRECTING  SCHEME 


The  data-keyed  cipher  of  the  last  section  offers  to 
the  system  a degradation  in  the  sense  that  the  probability 
of  word  error  is  doubled  due  to  the  nature  of  the  encipher- 
ment process,  as  was  explained.  This  increase  in  error 
will  undoubtedly  affect  the  legibility  of  any  message. 

Thus  it  was  necessary  to  look  into  error  correcting  codes 
that  will  eventually  overcome  this  present  disadvantage. 
Again  the  availability  of  the  digital  computer  proved  to 
be  very  useful  for  enciphering  the  message  and  to  encode 
it  for  transmission. 

The  error  correcting  code  developed  was  intended  for 
transmission  over  a memory less  binary  symmetric  channel. 

& memoryless  channel  is  the  one  on  which  noise  does  not 
depend  upon  previous  events.  A binary  symmetric  channel 
is  one  for  which  the  probability  of  a zero  to  be  changed 
to  a one,  is  equal  to  the  probability  of  a one  to  be 
changed  to  a zero,  during  transmission. 

Notation  that  will  encountered  through  this  section 


follows : 

k = 

Number  of  information  digits 

m = 

Number  of  check  bits 

n = 

Code  word  length  (n  = k + m) 

e = 

Maximum  number  of  correctible  bit  errors 
in  one  word 

R = 

Data  rate  (R  = k/n) 

I 


6 = Binary  symmetric  channel  parameter 

p(l/0)  * p(0/l) 

d * Hamming  distance  between  code  words. 


A.  BEST  CODE  DETERMINATION  , 

The  noise  channel  theorem  as  stated  by  Shannon  [Ref.  14] 
is: 

Let  a discrete  channel  have  the  capacity  C 
bits/sec.  and  a discrete  source  has  the 
entropy  per  second  H.  If  H < C there 
exists  a coding  scheme  such  that  the  output 
of  the  source  can  be  transmitted  over  the 
channel  with  an  arbitrarily  small  frequency 
of  errors.  If  H > C , it  is  possible  to 
encode  the  source  so  that  the  equivocation 
is  less  than  H - C + e , where  e is 
arbitrarily  small.  There  is  no  method  of 
encoding  that  gives  an  equivocation  less 
than  H - C . 

The  discrete  source  entropy  for  long  messages  consistin 
of  discrete  symbols  is  given  by 


n 

II (x)  ■ - E p.  log  p. 

i=l  1 1 


where  pi  is  the  probability  of  occurrence  of  a given  symbol. 
In  the  situation  where  the  symbols  are  transmitted  over  a 
noisy  channel  a given  symbol  x^  may  be  received  as  y^. 
Shannon's  measure  of  uncertainty  at  the  receiver  of  what 
was  actually  transmitted  is  defined  as: 


H (x/y)  = - I L p(xify.)  log  p(xi/yi) 

x y 


i 


88 


For  the  binary  symmetric  channel  this  uncertainty  is  given 
by: 


H(x/y)  = - (0  log  0 + (1-0)  log  (1-0)) 

Then  the  channel  capacity  is  given  by 

C = H (x)  - H (x/y)  maximized  for  H(x)  . 

A significant  parameter  commonly  used  is  the  probability 
of  word  error  in  the  message  instead  of  the  uncertainty 
measure.  The  probability  of  word  error  is  defined  as: 

P(e)  « Number  of  wrong  decoded  words 
' ' Number  of  words  in  message 

It  must  be  noted  at  this  point  that  there  will  not 
necessarily  be  a code  word  for  each  ASCII  character  used. 

In  fact  this  was  the  case  for  the  code  implemented,  where 
each  4 bits  of  the  message  sequence  is  encoded  into  a 
15-bit  word.  Thus,  each  8-bit  ASCII  character  was  encoded 
into  two  words  for  transmission. 

A "best  code"  means  one  that  has  least  probability  of 
error  for  any  give  channel  0 and  the  highest  rate  given  by 
the  ratio  of  information  bits  over  the  bit- length  of  each 
code  word.  The  error  correction  ability  of  the  code  can 
be  derived  from  the  Varsharmov-Gilbert-Sacks  condition 
(upper  bound) 
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2e-l 

Z 

i=0 


2 


m 


> 


which  is  a sufficient  but  not  necessary  condition.  And 
from  the  Hammings  lower  bound  inequality 


2m 


> 


which  is  a necessary  but  not  sufficient  condition  for 
designing  an  e-tuple  error  correcting  code. 

Conversely,  using  these  conditions,  once  a code  is 
chosen  and  specified  by  its  rate  (R)  and  code  word  length 
(n) , the  number  of  correctible  e- tuples  can  be  determined. 

The  theoretical  value  of  probability  of  error  is  given 
by  Ash  [Ref.  18]: 


p(e)  = 1 


e 

Z N.  81(l-0)n_1 
i=0  1 


where  is  the  number  of  correctible  e-tuple  errors,  and 
e^  = 0,1,2, ... , up  to  the  maximum  number  of  correctible 
errors  per  word. 

The  Hamming  distance  (d)  is  the  minimum  distance  between 
code  words.  If  d happens  to  be  even  and  the  maximum  value 
of  e is  given  by  (d-l)/2  , this  will  yield  a fraction. 


90 


i 

! 


a 

5 


Then  the  number  of  maximum  e-tuple  errors  is  given  by 
Shiva  [Ref.  191 

u(u  + 1) 

Number  of  correctible  d/2  errors  _ . ’z 

Total  number  of  d/2  errors  ~ 


d/2 


where 


y = 


d: 

(f)  i (§)  i 


For  the  same  channel  (8  constant) , reducing  the  probability 
of  error  results  in  a reduction  of  the  code  rate.  Working 
backwards,  for  any  given  probability  of  error  and  word 
length,  one  can  estimite  the  information  length  and  code 
rate  by  using  the  Varsharmov-Gilbert-Sacks  condition. 

In  the  present  work  a cyclic  code  with  a rate  R = 4/15 
is  implemented  to  overcome  the  degradation  due  to  the  noisy 
channel.  Its  effectiveness  was  tested  by  simulating  trans- 
mission over  a binary  symmetric  channel  with  different 
values  of  B. 


B.  THE  (15,4)  CYCLIC  CODE  AND  ITS  COMPUTER  REALIZATION 

The  theory  of  Cyclic  Codes  and  their  representation  by 
means  of  a k-stage  feedback  shift  register  is  very  well 
treated  by  Ash  [Ref.  18]. 


1.  Selection  of  Polynomial 


this  code  was  chosen  from  Appendix  C of  Peterson  [Ref.  20] , 
and  it  was 

G(x)  = x4  + x + 1 

which  is  an  irreducible  polynomial  and  which  can  be 
represented  by  a 4-stage  shift  register  as  shown  in  Figure 
26.  Since  G(x)  is  a maximum  period  irreducible  polynomial! 
with  a period  2 - 1 = 15  , it  divides  the  polynomial 
x15  + 1 (modulo  2).  Thus,  the  check  polynomial  for  this 
code  will  be 

H(x)  = = X11  + x8  + x7  + x5  + X3  + X2  + x + 1 

The  polynomial  cnosen  originates  a (15,4)  cyclic  code, 
that  is,  a code  where 

k = 4 
m = 11 
n = 15 

The  coefficients  of  the  check  polynomial  for  the  code 
word  00010011010111.  Since  the  code  is  cyclic,  any  cyclic 
shift  of  the  check  word  and  any  linear  combination  of  code 
words  is  another  code  word.  This  property  of  the  cyclic 
code  represents  an  advantage  for  decoding  purposes. 


2.  Computer  Realization  of  Encoder 

Encoding  in  a digital  computer  is  accomplished  by 
realizing  the  shift-register  operations  by  implementing  a 
matrix  multiplication  of  the  message  word  by  a generator 


The  generator  matrix  for  the  characteristic  poly- 
4 

nomial  G(x)  * x + x + 1 used,  was 


100010011010111 
rr1  010011010111100 
l°J4,15  “ 001001101011110 

000100110101111 

which  when  multiplied  by  the  message  word  [x]^  4 , yielded 
the  code  word  [w] ^ 15  . 

A further  comment  can  be  made  on  the  structure  of 
the  generator  matrix:  The  four  rows  are  code  words  and 

they  are  linearly  independent,  and,  any  of  the  other  code 
words  can  be  obtained  by  linear  combination  of  these  four 
rows.  For  ease  of  computer  implementation,  to  obtain  a code 
word  it  was  only  needed  to  exclusive-or  the  rows  of 
[G] i 15  where  a 1 occurs  in  the  message  word.  For 
example, 

[X] . .=1100  (message  word) 

9 * 

First  row  of  G =100010011010111  + 

Second  row  of  G = 0 10011010111100 
Code  word  110001001101011 


Appendix  E shows  the  complete  listing  of  this 
encoding  program. 

3.  Minimum  Distance  Decoder 

Table  VIII  gives  the  code  words  for  the  16  possible 
message  words  when  the  (15,4)  cyclic  code  is  used.  It  can 
be  observed  that  the  Hamming  distance  between  these  code 
words  is  8.  That  is,  the  number  of  different  digits  between 
code  words  is  8 (d  «*  8) . 

With  the  minimum  distance  decoder,  if  any  combination 
of  * or  less  errors  occur  in  a received  code  word,  it 
can  be  corrected  with  absolute  certainty.  For  this  code,  any 
3 or  less  errors  can  be  corrected  successfully. 

For  the  case  when  4-digit  errors  occur  (e  = 4) , the 
Varsharmov-Gilbert-Sacks  condition  (Upper  bound) 


is  not  satisfied  and  thus  there  exists  an  uncertainty  on 
whether  a 4-digit  error  will  be  corrected.  It  has  been 
found  experimentally  that  67.8%  of  different  combinations 
of  4-digit  errors  can  be  corrected.  Appendix  G shows  the 
complete  listing  of  the  decoding  program. 

C.  NOISY  CHANNEL  SIMULATION 

Table  IX  provides  the  expected  probabilities  of  error 
for  transmission  over  a noisy  binary  symmetric  channel  when 
using  the  (15,4)  cyclic  code  presented,  as  given  by 
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Information 

word 

0 0 0 0 

0 0 0 1 

0 0 10 

0 0 11 

0 10  0 

0 10  1 

0 110 

0 111 

10  0 0 

10  0 1 

10  10 

10  11 

110  0 

110  1 

1110 

1111 


Coded 

0 0 0 0 0 0 0 
0 0 0 1 0 0 1 
0 0 1 0 0 1 1 
0 0 110  10 
0 10  0 110 
0 10  1111 
0 110  10  1 
0 11110  0 
1 0 0 0 1 0 0 
10  0 110  1 
10  10  111 
10  11110 
1 1 0 0 0 1 0 
110  10  11 
1 1 1 0 0 0 .1. 
11110  0 0 


Word 

00000000 
10x01111 
01011110 
11  110001 
10111100 
00010011 
11100010 
0100.1101 
11010111 
01111000 
10001001 
00100110 
01101011 
11000100 
00110101 
10011010 


TABLE  VIII.  Message  words  and  their  correspondent 
code  word  for  the  (15,4)  cyclic  code 
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Channel 

3 

Probability  of  error  P{e) 

0.07050 

5.4480 

x 10~3 

0.09797 

2.9176 

x 10“2 

0.12426 

6.2425 

x 10’2 

0.13992 

1.2542 

x 10"1 

0.1709 

1.8780 

-1 

x 10  1 

0.26613 

4.9052 

x lo”1 

TABLE  IX.  P(e)  vs.  channel  B for  the  code  (15,4) 


Cetinyilmaz  [Ref.  21].  In  the  same  reference  a noise 
generating  program  is  presented  to  simulate  different 
conditional  probabilities  of  error  for  the  BSC.  The  same 
program  was  used  in  this  thesis  to  simulate  a noise  BSC 
and  to  test  the  effectiveness  of  the  code  implemented. 
Appendix  F gives  a listing  of  the  program. 

Having  the  enciphering  scheme,  the  error  correcting 
code  and  a mean  for  introducing  noise  into  the  message  to 
reflect  different  values  of  $ for  the  channel,  all  were 
combined  to  simulate  a Secure  Digital  Communication  System, 
as  depicted  by  Figure  27. 

The  following  is  the  complete  program  flow  for  the 
system: 
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a)  Input  program  (address  20000  to  20036)  - The  message 
is  typed  in.  The  program  stores  the  message  in  ASCII  code 
form  into  memory  locations  30002-32000  (16-bit  form) . 

b)  Data-keyed  cipher  program  (10000-11044)  - The  key 
to  be  used  is  typed  in,  the  program  stores  it  at  30000. 

The  program  takes  the  message  from  30000-32000,  ciphers  it 
and  then  stores  it  at  40000-42000  (16-bit  form) . The 
parameter  Ni"  can  be  selected  at  address  10014. 

c)  Input  interface  program  (14000-14036)  - This  program 
puts  the  ciphered  text,  already  in  16-bit  form,  into  8-bit 
form  to  be  handled  by  the  encoding  program.  8-bit  charac- 
ters are  moved  into  memory  locations  51000-52000. 

d)  Encoder  program  (14040-14152)  - Encodes  message  and 
stores  coded  words  into  memory  locations  52000-54000. 
Generator  matrix  is  stored  at 


Memory  location  Content 

50200  104656 

50202  46570 

50204  23274 

50206  11536 


e)  Noise  generating  program  (14540-14754) 

f)  Noise  mixing  program  (14756-15050)  - Takes  coded 
words  from  52000-54000  and  exclusive-ors  them  with  noise 
words  at  32000-34000,  thus  introducing  noise  into  the  text. 
Results  are  stored  back  at  52000-54000. 
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g)  Minimum  distance  decker  (14154-14436)  - Takes  the 
distorted  coded  words  from  location  52000-54000,  decodes 
them  if  they  are  correctible  and  stores  the  decoded  words 
at  location  56000-57000.  Check  polynomial  is  11536  at 
address  50104. 

h)  Output  interface  program  (14440-14464)  - Takes  decoded 
words  and  moves  them  to  30000-32000  to  be  deciphered. 

i)  Data-keyed  deciphering  program  (10000-11044)  - Same 
as  (b) , the  only  change  needed  is  to  chang j the  contents  of 
address  10012  from  40002  to  30002  to  be  compatible  with  the 
decipherment  process.  The  program  deciphers  the  message 
and  stores  the  results  in  memory  locations  40000-42000. 

j)  Output  program  (12000-12244)  - Prints  the  cryptogram 
and  the  plaintext  message. 
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VIII.  SUMMARY  AND  CONCLUSIONS 

f\ 

After  looking  at  the  computer  organization  and 
establishing  a basis  to  realize  reversible  transformations,  • 

three  cryptographic  systems  were  implemented: 

l 

1.  Simple  substitution 

2.  Pseudo-random  cipher  : 

3.  Data-keyed  cipher 

The  first,  provided  the  basis  for  the  other  two.  It 
was  not  intended  to  provide  any  significant  amount  of 
security  since  the  cryptanalytic  weakness  of  a simple 
substitution  is  well  known. 

The  pseudo-random  cipher  is  provided  with  a means  to 
do  polyalphabetic  substitutions.  This  kind  of  cipher  is 
known  to  be  time  cons  riding  when  done  manually.  The  algorithm 
U3ed  to  generate  pseudo- random  keys  was  a simple  one, 
though  it  can  be  as  complex  as  the  user  desires. 

With  the  data-keyed  cipher  very  significant  results 
were  obtained  in  the  sense  that  it?  distribution  plots 
were  fairly  flat.  A disadvantage  presented  by  this  cipher 
was  the  error  propagation  when  deciphering.  This  fact 
motivated  the  author  to  look  into  error  correcting  codes 
to  use  them  with  this  or  any  other  system.  A (15,4)  cyclic 
error  correcting  block  code  was  implemented.  This  code 
contributed  appreciably  to  reduce  the  probability  of  error, 
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P (fc) , when  transmission  was  simulated  over  a noisy  binary 
symmetric  channel. 

Finally,  it  can  be  said  that  the  digital  computer  is 
suitable  for  encrypting  and  coding  data  for  transmission, 
providing  at  the  same  time  many  different  alternatives  for 
both  functions.  With  the  advent  of  microprocessors  and 
with  communication  systems  tending  to  become  all  digital, 
it  is  certain  that  we  will  see  in  the  future  a computer 
performing  these  functions  together  with  many  more. 
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APPENDIX  A.  - PROGRAM  FOR  THE 


SIMPLE  SUBSTITUTION  CIPHER 


010800 

7005000 

010002 

7005882 

010004 

7085027 

010006 

7177568 

013010 

7105727 

018012 

7177558 

013014 

7188275 

019016 

7012700 

019020 

7177562 

010022 

7095002 

619024 

7020027 

019026 

7060260 

019020 

7100002 

019022 

7012702 

019024 

7080001 

019026 

7080416 

010040 

7920027 

010042 

7090200 

010044 

7100002 

019046 

7012702 

010050 

7090002 

010052 

7090410 

019954 

7020027 

010056 

7000229 

019060 

7100902 

019062 

7012702 

010064 

7000095 

013066 

7909402 

010070 

7012702 

010072 

7090097 

013074 

7005202 

010076 

7105727 

019109 

7177564 

013192 

7109275 

010184 

7110927 

019196 

7177566 

010110 

7095001 

019112 

7095027 

010114 

7177560 

013116 

i s =;  7 7 - 

i 4.  W V • .1 

0 19120 

e : £ a 

• * * • w w L1 

103 


’T*4  V< 4 1 


1 

> 


SIMPLE  SUBSTITUTION  PROGRAM. , . CONTINUATION 


018122  7100275 
010124  7012701 
010126  /177562 
010120  7122701 
010122  7000215 
010124  7001024 
010126  '105727 
010140  7177564 
010142  7100275 
010144  7110127 
010146  7177566 
010150  7012702 
010152  7080012 
019154  7105727 
010156  7177564 
010160  7100275 
010162  7112727 
010164  7000200 
010166  7177566 
010170  7077207 
910172  7105727 
010174  7177564 
010176  7100275 
01O200  7112727 
010202  7000212 
010204  7177566 
010206  7105727 
010210  7177564 
018212  7100275 
010214  7112727 
010216  7000212 
010220  7177566 
010222  7000127 
G10224  7001172 
016226  7922702 
010220  7009004 
010222  7100455 
010224  7022702 
010226  7000002 
010240  7100425 
010242  7020127 
019244  7090260 
010246  71000O2 
01025O  7012794 
910252  7090260 


104 


SIMPLE  SUBSTITUTION  PRQGRRM.  .. . CONTINUATION 


010254  7000520 
010256  7820127 
010260  7000200 
010262  7100002 
010264  7012704 
010266  7000260 
010270  7000512 
010272  7020127 
010274  7000220 
010276  7100002 
010200  7012704 
010202  7800260 
010204  7800504 
910206  7012704 
010210  7000260 
010212  7900501 
010214  7&20127 
010216  7900260 
010220  7100802 
010222  7012704 
010224  7000240 
010226  7080472 
010220  7028127 
010222  7800200 
010224  7100802 
010226  7012704 
010240  7000248 
0102*2  7000465 
010244  7020127 
010246  7000220 
010250  71000O2 
010252  7012704 
010254  7080240 
010256  7800457 
010260  7012704 
010262  700024O 
010264  7000454 
310266  7022702 
010270  7000086 
018272  7100425 
010274  7020127 
810276  7000268 
010408  7180002 
01O402  7012704 
010404  700OI20 
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SIMPLE  SUBSTITUTION  PROGRAM...  . CONTINUATION 


$ 


010406  7000442 
010410  7020127 
010412  7000200 
010414  7100002 
010416  7012704 
010420  7000220 
010422  7000425 
010424  7020127 
010426  7000220 
010420  7100002 
010422  7012704 
010424  7000220 
010427  7000427 
01044&  7012704 
010442  7000220 
010444  7000424 
010446  7020127 
019450  7000260 
010452  7100002 
010454  7012704 
010456  7000200 
010460  7000416 
O10462  7020127 
010464  7000T00 
010466  7100002 
010470  7012704 
010472  7000200 
010474  7000410 
010476  7020127 
010500  7000220 
010502  7100002 
010504  7012704 
010506  7000200 
010519  7000402 
010512  7012704 
010514  7000200 
010516  7074001 
010520  7074401 
010522  7105727 
010524  7177564 
010526  7100273 
010520  7110127 
010522  7177566 
010524  7005202 
010526  7020227 
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SIMPLE  SUBSTITUTION  PROGRRM.  ., CONTINUATION 


010549  7000050 
010542  7001036 
010544  7005002 
010546  710573? 
010550  7177564 
010552  7100375 
010554  711273? 
010556  7000215 
010560  7177566 
010562  7012702 
010564  7000012 
010566  710573? 
010576  7177564 
010572  7100375 
010574  711273? 
010576  7000200 
010600  7177566 
010602  707720? 
010604  710573? 
010606  7177564 
010610  71O0375 
010612  711273? 
010614  7000212 
010616  7177566 
010620  710573? 
010622  7177564 
010624  7100375 
010626  7112737 
010630  7000212 
010632  7177566 
910624  7005002 
010636  7005004 
010640  700016? 
010642  7177244 
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APPENDIX  B. - PROGRAM  TO  COUNT  THE  NUMBER 
OF  OCCURRENCES  OF  EACH  CHARACTER  IN  A MESSAGE 
5T0RED  AT  LOCATION  4O00B  AND  UP 


013080  7012704 
013002  7117700 
013004  7012702 
013806  7000240 
013010  7005003 
813012  7012701 
013014  7040000 
013016  7021127 
013020  7000215 
013022  7001404 
013024  7022102 
013026  7001373 
013020  7005202 
013022  7000771 
012034  7000240 
012026  7000240 
012040  7000240 
012042  7105727 
012044  7177564 
012046  7100275 
012050  7118227 
012052  7177566 
012054  7810224 
012056  7005202 
012060  7020227 
012062  7000240 
012064  7001251 
013066  7000000 
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APPENDIX  C. - PROGRAM  TO  COMPUTE  STATISTICS 
OF  MESSAGE 


10  BLKDEF  B0,  32,  1 
20  BLKDEF  Bl,  32,  0 
30  BLKDEF  B2,  32,  0 
40  BLKDEF  B3,  32,  1 

50  LET  B3,  0,  ' 0RBCDEF GH I JKLMNOPP.RSTUVWXV2I 
60  BIBSET  B0, 3,  11 

65  BIBSET  B0, 1, 15 

66  BIBSET  B3, 1, 15 
70  LINK  ' 110000', 11 
150  FLOAT  B0, Bl 
155  MOVE  Bl, B2 

160  I NTG  Bl 

170  LET  P 0 , Bl,  21 

171  MOVE  B2, Bl 

180  PRINT  'TOTAL  NUMBER  OF  OCCURRENCES=  ' , R0 
lei  PRINT  ' ' 

190  PRINT  'CHAR  NO.  OF  OCCURRENCES 

200  FOR  12,  0,  21 

210  LET  Rl, Bl,  12 

220  STACK  201,  200,  5,  100..  , 4,  254 

240  LET  Bl,  12,  R4 

250  TRANS  0,  B2,  12,  11 

260  HOLOUT  'KB' , 11, ' : ' 

270  LET  R2, Bl,  12 

271  LET  IS,  BO,  12 

288  PRINT  ' ',12 

282  NEXT  12 
285  PRINT  ' ' 

291  OSPEC  ' CR' 

292  DISPLV  Bl, ' M' , ' G' 

292  OSPEC  ' K8' 

200  LET  Rl- 22. 

210  MOVE  Bl, B2 

220  MUL  Bl, Bl 

220  I NTG  B2 

240  LET  R2,  B2,  21 

250  QUOT  R2, R2, Rl 

260  PRINT  'EXPECTED  VALUE  = ' , R2 

280  PROD  R2,  R2,  P2 

290  I NTG  Bl 

400  LET  R2, Bl, 21 

410  QUOT  R2, R 2 , Rl 

420  DIF  R2, R2, R2 

420  PRINT  ' VARIANCE  = ' , R2 

450  STACK  202, 16, 255 

469  PRINT  'STANDARD  DEVIATION  = ' , R5 

470  RETURN 
END 
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APPENDIX  D.  - PROGRAM  FOR  THE 
DATA-KEYED  CIPHER 


018000  7812727 
010002  7040002 
010004  7001006 
010006  7012727 
010010  7000007 
010012  700x312 
010014  7005027 
010016  7027770 
010020  7005000 
010022  7005002 
010824  7005027 
0i0026  7177560 
010020  7105727 
010022  7177560 
010024  7100275 
010026  7012700 
010040  7177562 
010042  7005002 
010044  7020027 
010046  7000260 
010050  7100002 
010052  7612702 
010054  7000001 
010056  7000416 
010060  7020027 
010062  7000200 
010064  7100002 
010066  7012702 
010070  7000002 
010072  7000410 
010074  7020027 
010076  7000220 
010100  7100002 
010102  7012702 
010104  7000005 
010106  7000402 
010110  7012702 
010112  7000007 
01O114  7005202 
010116  710572  7 
010120  7 1 7 7 5 6 4 


DATA-KEVED  PROGRAM. ,.  CONTINUATION 


010122  7100275 
010124  7118027 
010126  7177566 
010120  7010027 
010122  7028000 
010124  7010027 
010126  7040000 
010140  7812727 
010142  7020002 
010144  7001002 
010146  7012727 
010150  7040002 
010152  7001004 
010154  7005001 
010156  7005027 
010160  7177560 
610162  7105727 
010164  7177560 
010166  7100275 
010178  7012701 
010172  7177562 
010174  7012704 
010176  7001002 
010200  7010124 
010202  7010427 
010204  7001002 
010206  7005004 
010210  7022701 
010212  7000215 
010214  7091042 
010216  7012704 
010220  7901004 
010222  7010114 
010224  7105727 
010226  7177564 
010220  7100275 
010222  7110127 
010224  7177566 
010226  7012702 
010240  7000012 
810242  7185727 
010244  7177564 
010246  7100275 
010250  7112727 
010252  7000200 
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DATA-KEVED  PROGRAM.  . « CONTINUATION 


010254  7177566 
019256  7077207 
010260  7105727 
010262  7177564 
010264  7100275 
010266  7112727 
010270  7000212 
010272  7177566 
010274  7105727 
010276  7177564 
610200  7100275 
010202  7112727 
010204  7000212 
010206  7177566 
010210  7000127 
010212  7001172 
010214  7000240 
010216  7022702 
010226  7000004 
010222  7100455 
010224  7022702 
010226  7000002 
010220  7100425 
010222  7020127 
010224  7000260 
010226  7100002 
01024S  7012704 
010242  7000260 
010244  7900520 
010246  7020127 
010250  7000200 
010252  7100002 
010254  7012704 
010256  7000260 
010260  7000512 
010262  7020127 
010264  7000220 
010266  7100002 
010270  7012704 
010272  7000260 
010274  70O0504 
010276  7012704 
010400  7000260 
010402  70005O1 
010404  7020127 
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DATA-KEYED  PROGRAM.  ,.  CONTINUATION 


810406 
818416 
018412 
810414 
818416 
010426 
. 010422 
010424 
019426 
018420 
010422 
010424 
010426 
010446 
010442 
610444 
010446 
010450 
010452 
010454 
010456 
010460 
010462 
010464 
010466 
010420 
010472 
010474 
010476 
010500 
010502 
010504 
010506 
010510 
010512 
010514 
010516 
010520 
010522 
010524 
010526 
010526 
010522 
010524 
610526 


7000260 

7100003 

7012704 

7000240 

7008472 

702“12? 

7000200 

7100003 

7012704 

7000240 

7000465 

7020127 

7800220 

7160002 

7012704 

7000240 

7000457 

7012704 

7000240 

7000454 

7022702 

7060006 

7100425 

7020127 

7000260 

7100002 

7012704 

7000220 

7000442 

7620127 

7000200 

71O0002 

7012704 

7000220 

7000425 

7020127 

7000220 

7100002 

7012704 

7000220 

7000427 

7012764 

7000220 

7000424 

7020127 


1 

i 

l 

L 

['  ill 


DATA-KEVED  PROGRAM.  . . CONTINUATION 


010549  7000260 
010542  7100002 
010544  7012704 
910546  7000200 
010550  /000416 
010552  7028127 
810554  7000200 
010556  7188002 
010566  7012704 
010562  7000200 
610564  7000410 
010566  7020127 
010570  7000220 
010572  7100002 
010574  7012704 
010576  7000209 
010600  7000402 
010602  7012704 
010604  7000200 
010606  7074001 
010610  7074401 
010612  7022727 
010614  7001012 
010616  7027770 
010620  7100024 
010622  7012704 
010624  7001606 
010626  7012427 
010620  7001014 
010622  7010427 
010624  7001006 
010626  7012704 
010640  7000004 
010642  7106227 
010644  7001014 
010646  7077402 
010650  700O241 
010652  7012704 
GU0654  7800005 
010656  7106127 
010660  7801014 
01O662  7077402 
CH0664  7812704 
O10666  7B01014 
010670  7074401 
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DflTft-KEVED  PROGRAM.  . . CONTINUATION 


010672  7805004 
010574  7000240 
010676  7000240 
010700  7000240 
010702  7105727 
010704  7177564 
010706  7100275 
010710  7110127 
010712  7177566 
010714  7012704 
010716  7001004 
010720  7010124 
010722  7010427 
010724  7001004 
010725  7005227 
010720  7027770 
010722  7005202 
010724  7020227 
010726  7000050 
010740  7001026 
010742  7005002 
019744  7105727 
010746  7177564 
010750  7100275 
010752  7112727 
010754  7000215 
010756  7177566 
010760  7012702 
010762  7000012 
010764  7195727 
016766  7177564 
010770  7100275 
010772  7112727 
010774  7000200 
010776  7177566 
011000  7077207 
011002  7105727 
011004  7177564 
011006  7100275 
011010  7112727 
011012  7O00212 
O11014  7177566 
011016  7105727 
011028  7177564 
O11022  7100275 
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DATA-KEVED  PROGRAM. . . CONTINUATION 


011024  711272? 
011026  7000212 
011020  7177566 
011022  7005002 
011024  7005004 
011 026  7000167 
011040  7177112 
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APPENDIX  E. - ENCODING  PROGRAM  FOP  j 

THE  < 15,4  ) CVCLIC  CODE  1 


014040  7812700 
014042  7051000 
014044  7000240 
014046  /0O0240 
014050  7012702 
014052  7050109 
014054  7112027 
614056  7050140 
014060  7012702 
014062  7000002 
014064  7012704 
014066  7000004 
014070  7012705 
014072  7050200 
014074  7005027 
014076  7038142 
014100  7012501 
014102  7106227 
014104  7058140 
014106  7102002 
014110  7074127 
014112  7050142 
014114  7000240 
014116  7077410 
014120  7012727 
014122  7050142 
014124  7052009 
814126  7005227 
014120  7014124 
014122  7805227 
014124  7014124 
814126  7077226 
014140  7077222 
014142  7012727 
014144  7052000 
014146  7020210 
014158  7008127 
014152  7901172 


I 

4 
) 

5 

4 

| 
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014540 

014542 

014544 

014546 

014550 

014552 

014554 

014556 

014560 

014562 

014564 

014566 

014570 

014572 

014574 

014576 

014609 

014602 

014604 

014606 

014610 

014612 

014614 

014616 

014620 

014622 

014624 

014626 

014620 

014622 

014624 

014626 

014640 

014642 

014644 

014646 

014650 

014652 

014654 

014656 

014660 

014662 

014664 

014666 

014670 
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APPENDIX  F.  - NOISE  GENERATING  PROGRAM 


7012700 
7022000 
7012701 
7001000 
7005020 
7077102 
7000240 
7012700 
7057000 
7012746 
7012705 
7012746 
7000020 
7011667 
7000025 
7012704 
7 1 7 7 2 0 4 
7612714 
7010000 
7012627 
7177200 
7011467 
7060020 
7012701 
7177216 
7612702 
7000020 
7012624 
7012714 
7009401 
7014446 
7062716 
7000002 
7077207 
7005227 
7060000 
7061414 
7011614 
7005044 
7012711 
7177775 
7005724 
7042714 
70O0001 
7066014 
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NOISE  GENERATING  PROGRAM.  . , CONTINUATION 


014672  /012774 
014674  7000001 
014676  /80e000 
814700  7000750 
014702  7005026 
014704  7012700 
014706  7057000 
014710  7012781 
014712  7022000 
014714  7012702 
014716  7000177 
014720  7012702 
014722  7000020 
014724  7006220 
014726  7006011 
014720  7877202 
014722  7005721 
014724  7012702 
914726  7000005 
014740  7S06228 
014742  7806011 
814744  7077202 
014746  7085721 
014750  7077215 
014752  7000127 
014754  7901172 
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APPENDIX  G. - DECODING  PROGRAM  FOR 
THE  HINIHUH  DISTANCE  DECODER 


014154  7012700 
014156  7052000 
014160  701273? 
014162  7050100 
014164  7050102 
014166  706272? 
014170  7050100 
014172  7050102 
014174  7012701 
014176  7050104 
014200  7012702 
014202  7054000 
014204  7012704 
014206  700001? 
014210  7905027 
014212  7050116 
014214  7011005 
014216  7074105 
014220  7012702 
014222  70O0017 
014224  7006205 
014226  7005527 
014220' 7050116 
014222  0 7 7 2 0 4 
014224  7022727 
014226  7000004 
614240  7050116 
014242  7002010 
014244  7006201 
0:14246  7102402 
014250  7077421 
014252  7000407 
014254  7062701 
014256  7000002 
014260  7077425 
014262  7000402 
014264  7010122 
014256  7005720 
014270  7060402 
014272  7012722 
014274  7900000 


p s® 


120 


f DECODING  PROGRAM.  .. CONTINUATION 

J 


014276  7805720 
814200  /l €2727 
014202  7000801 
014204  7050102 
014206  7002226 
014210  7006240 
014212  7000240 
014214  7000240 
014216  7000240 
014220  7012700 
014222  7050100 
014224  7812701 
014226  7054001 
014220  7012702 
014222  7056008 
014224  7005002 
014226  7005004 
014240  7112102 
014242  7005201 
014244  7112104 
014246  7005201 
014250  7012705 
014252  7000005 
014254  7090241 
014256  7106102 
014360  7077502 
014262  7012705 
014364  7080004 
014266  7106202 
014370  707; 302 
014272  7012705 
014274  7000005 
014276  7000241 
014400  7106104 
014402  7077502 
014404  7012705 
014406  7000004 
014410  7106204 
014412  7077502 
014414  7012705 
014416  7000005 
014420  7000241 
014422  7106104 
014424  7077502 
014426  7074204 
014420  7110422 
014422  7077040 
014424  7800127 
014426  7001172 
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